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ABSTRACT 


Little  is  knoun  of  the  ear’s  phase  sensitivity  limits 
outside  the  well  knoun  fact  that  excessive  dispersion  in  speech 
signals  results  in  chirp-like  sound  quality.  The  limits  of  phase 
sensitivity  may  be  determined  in  reference  to  the  parameters  of  an 
idealized  dispersion  sines  phase  shifts  are  nothing  mors  than  delays 
In  the  short-time  spectra  of  a signal.  A number  of  discrimination 
tests  uers  conducted  uhich  contrasted  standard  stimuli  with  phase 
modified  variants  of  the  stimuli  to  determine  the  limits  of  phase 
sensitivity.  The  Idealized  dispersion  uas  that  of  delaying  a band 
of  frequency  components  such  that  di ecr iminabi I i ty  could  be  measured 
as  functions  of  center  frequency,  banduidth  and  delay.  The  stimuli 
used  in  the  tests  included  an  impulse,  phonemes,  words,  and  a 
sentence. 

Sensitivity  to  dispersion  in  impulses  uas  shown  to  be 
dependent  on  Intensity,  center  frequency,  banduidth,  and  delay. 
Olscrimiritlon  scores  half  way  between  chance  and  perfect 
performance  uers  achieved  for  dispersed  impulses  uith  delayed  bands 
of  frequency  components  centered  between  2S0  and  500  Hertz  and 

This  report  reproduces  a dissertation  of  the  same  title 
submitted  to  the  Department  of  Communication,  Univereity  of  Utah,  in 
partial  fulfillment  of  ths  requirements  for  the  degree  of  Doctor  of 
Phi loeophy. 
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delayed  between  0.0625  and  0.125  milliseconds.  Sensation  level  of 
the  stimulus  pairs  was  80  dB.  The  effect  of  bandwidth  on 
dl  ecr Iminabi I i ty  appeares  to  reach  a plateau  at  100  Hertz,  which  is 
to  say  that  bandwidths  of  interest  are  less  than  100  Hertz. 

The  neutral  vowel,  the  plosive  /t/,  and  the  fricative  /f/ 
were  used  as  stimuli  in  experiments  in  which  phonemes  served  ae  the 
etandard  stimuli.  The  stimulus  pairs  were  presented  at  levele 
approximating  normal  conversational  level  speech,  i.e.,  70,  32,  and 
35  dB  Saneatlon  Level  respectively.  Dl ecr Iminabi 1 1 ty  was  obeerved 
to  be  strongly  dependent  on  bandwidth  and  delay.  A very  slight,  if 
any,  decrease  in  di scr iminabi I i ty  was  observed  with  Increased  center 
frequency.  Rank  ordering  of  the  phonemes  in  decreasing  sensitivity 
to  dispersion  is  plosive,  vouel,  and  fricative.  For  bandwidths  of 
approximately  100  to  400  Hertz,  the  plosive  requires  4 to  8,  the 
vowel  8 to  16,  and  the  fricative  16  to  32  milliseconds  of  delay. 
For  wider  bandwidths,  the  respective  ranges  are  2 to  4,  2 to  4,  and 
4 to  8 milliseconds.  More  explicitly,  for  narrower  bandwidths,  the 
plosive  requires  4 to  8 milliseconds  for  100  to  200  Hertz  bandwidth, 
the  vowel  8 to  16  milliseconds  for  100  to  400  Hertz  bandwidth,  and 
the  fricative  16  to  32  milliseconds  for  100  to  400  Hertz  bandwidth. 
For  wider  bandwidths  the  plosive  requires  2 to  4 milliseconds  for 
200  to  400  Hertz  bandwidth,  the  vowel  2 to  4 milliseconds  for  200  to 
300  Hertz  bandwidth,  and  the  fricative  4 to  8 milliseconds  for  400 
to  800  Hertz  bandwidth.  Less  delay  is  required  for  greater 


x 


bandwidth  since  dl scrlminabi I i ty  is  directly  dependent  on  both 
independent  varlablee. 

Dl ecr Imlnabl 1 1 ty  scores  obtained  from  teete  using  words  and 
a aantanca  as  standard  stimuli  agree  with  the  scores  obtained  from 
teats  ualng  phonemes.  Dl scrlminabi I i ty  half  way  between  chance  and 
perfect  performance  was  obtained  for  the  dlspereed  eentence  in  which, 
a 400  Hertz  band  of  components  centered  at  500  Hertz  wae  delayed 
between  4 and  8 milliseconds.  The  score  agreee  with  that  for  a 
vowel  which  Implies  that  cues  to  dispersed  sound  quality  in 
continuous  speech  arises  from  vowel  dispersions. 

The  results  of  the  study  indicates  that  speech  proceeeing 
eystems  introducing  no  more  than  a few  milliseconds  of  dispersion 
will  cause  little  detriment  to  the  speech  quality.  Thie  cone  I ua  ion 
la  significant  Inasmuch  as  it  Is  difficult,  if  not  Impoeslble,  to 
determine  the  phase  of  many  speech  processing  eystems. 


CHAPTER  I 


INTRODUCTION 

Phase  Sensitivity  and  Psychophysics 

Early  Investigators  of  hearing  science  had  long  understood 
that  the  pitch  and  loudness  of  a sinusoidal  sound  are  correlated 
respectively  ulth  the  frequency  and  Intensity  of  the  sound  source. 
The  physical  correlates  of  sound  quality,  however,  were  not  ae  well 
understood.  Helmholtz  (1863)  showed  that  the  only  attribute  of  a 
periodic  sound  to  which  quality  could  be  correlated  was  the  harmonic 
composition  of  the  sound. 

Helmholtz  satisfied  himsel f that  the  correlates  of  most 
Importance  to  sound  quality  were  the  period  and  amplitude  of  the 
constituent  frequency  components  and  that  phase  was  of  little 
consequence.  Research  since  the  time  of  Helmholtz  has  shown  that 
the  ear  la  undoubtedly  sensitive  to  phase.  The  full  nature  of  phase 
sensitivity,  however,  remains  incompletely  understood 
notwl thstandlng  the  importance  of  understanding  phase  perception  for 
both  theoretical  and  practical  reasons. 

The  theoretical  study  of  psychophysics  as  it  pertains  to 
phase  sensitivity,  or  any  other  aspect  of  perception,  as  a facet  of 
man's  communicative  system  is  justified  in  that  man  himeelf  is  an 
Interesting  subject  of  study.  Ae  a consequence  of  such  interest,  a 
goal  of  the  current  study  is  to  provide  a broader  theoretical! 
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Insight  Into  the  nature  of  phase  sensitivity. 

The  Role  of  Psychophysics  in  the  Oesign 
of  Speech  Processing  Systems 

The  study  of  phase  sensitivity  is  justified  tor  practical 
reasons  because  of  the  role  played  by  the  ear  as  the  final  I Ink  in 
speech  processing  systems.  Speech  processing  systems  might  Include 
those  for  coding  and  decoding,  transml ssion,  storage  and  retrieval, 
restoration,  enhancement,  etc.  Restoration  systems  would  be 
employed  to  retrieve  signals  from  excessive  noise  or  other 
contaminations.  Signal  enhancement  processing  could  be  Included  In 
the  design  of  hearing  aids  and  communication  systems. 

The  design  of  speech  processing  systems  must  be  approached 
with  an  awareness  of  the  capabilities  and  limitations  of  all  the 
individual  components,  including  the  ear,  which  constitute  the 
system.  For  example,  knowing  that  the  average  ear  is  capable  of 
correctly  interpreting  words  transmitted  through  a transmission 
system  of  only  3000  Hertz  bandwidth  is  extremely  important  in  the 
establishment  of  reliable  and  low  cost  commercial  telephony. 
Knowledge  of  the  ear’s  capabilities  dictates  a greater  transmission 
bandwidth  and  lower  permissible  distortion  levels  for  the  fidelity 
transmission  of  music  in  home  entertainment  systems. 

Because  of  the  real  need  for  specific  psychophysical  data 
applicable  to  speech  processing  system  design,  the  major  goal  of  the 
current  study  Is  to  provide  more  pertinent  data  pertaining  to  phase 
sensl tlvl ty. 
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Computer  Usage  in  Psychophysical  Research 

In  his  review  of  hearing  .science,  Boring  (1938)  noted  that 
there  uere  feu  facts  knoun  concerning  hearing  uhen  Helmholtz  urote 
hie  Lehre  *gn  dsn  Tonempf i ndunaen  in  18G3.  Most  of  the  facte  uere 
not  very  old.  Although  the  science  of  sound  may  be  dated  back  to 
Pythagoras,  Boring  noted  that  it  uas  not  until  the  advent  of  the 
electronic  vacuum  tube  by  de  Forest  in  1907  that  the  science  of 
sound  became  more  a field  of  success  than  effort.  The  major 
contribution  of  the  vacuum  tube  uas  to  provide  accuracy  of 
measurement  in  psychophysical  experimentation.  The  adoption  of 
electronics  to  the  science  of  hearing  houever,  uas  not  fully 
realized  until  very  late  in  the  1920's. 

A neu  era  in  the  development  of  hearing  science  began  uith 
the  rise  of  modern  digital  computers  in  the  1950's.  The  pouer, 
precision,  and  versatility  of  the  digital  computer  holds  great 
promise  In  the  further  advancement  of  psychophysical  research.  The 
major  contribution  of  computers  ui  I I result  from  their  application 
to  the  solution  of  exceedingly  complex  problems.  Indeed,  because  of 
the  complexity  of  the  problems  that  can  nou  be  practically 
approached,  many  modern  developments  in  psychophysics  uould  be 
Impossible  uithout  the  aid  of  the  large  and  pouerful  digital 
processing  machines. 

Tha  digital  processing  of  signals  represents  a major  uee  of 
computers  In  psychophysical  research.  One  salient  advantage  of 
digital  signal  processing  Is  that  the  process  need  not  be  linear. 
Examples  of  such  research  include  the  pioneering  uork  of  Stockham 


4 


(1972,  1975)  In  uhich  he  uas  able  to  remove  the  convolutional 
effects  of  the  recording  horn  from  old  Caruso  sound  recordings  using 
the  principles  of  homomorphic  f 1 I terlna.  I.e.,  general  I zed  I Inear 
f I I ter  1 no  (Oppenheim  ai  2±* • 1968).  Miller  (1973)  refined  and 
digitally  implemented  homomorphic  vocoder  techniques  for  the 
restoration  of  old,  noisy  sound  recordings.  The  nonlinear 
processing  required  in  the  restoration  uas  made  practical  only  by 
the  pouer  of  the  digital  computer. 

Other  pouerfui  and  Interesting  applications  of  computers  In 
psychophysics  includs  simulation,  stimulus  preparation,  control  of 
experiments,  data  acquisition,  and  data  reduction.  As  an  example  of 
simulation,  Callahan  (1975),  using  a tuo-dlmenslonal  speech 
processing  algorithm,  has  developed  the  capability  of  modeling 
auditory  effects  such  as  recruitment,  fatigue,  adaptation, 
inhibition,  etc. 

A major  thrust  of  the  research  to  be  described  here  has  been 
to  further  develop  and  apply  digital  techniques  uhich  may  be 
generally  applicable  to  psychophysical  research.  It  is  important 
not  to  allou  one’s  thinking  to  be  overly  constrained  by  the 
limitations  of  current  computer  architecture  uhen  developing  digital 
techniques.  The  advance  of  computer  science  is  so  rapid  that  larger 
machines  become  available  before  the  potential  and  capacity  of 
machines  already  in  use  may  be  fully  explored  and  exhausted.  Much 
understanding  of  the  practical  application  of  digital  techniques  to 
future  psychophysical  research  may  be  lost  if  the  full  potential  of 
present  day  machines  is  not  fully  investigated.  The  study  of  the 
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computer  as  a basic  research  tool  is  justified  in  its  own  right 
inasmuch  as  the  proven  usefulness  of  digital  computers  has  made 
computer  application  to  psychophysical  research  as  basic  as 
psychometric  methods. 

Uniqueness  and  Significance  of  the  Study 

This  study  differs  from  previous  studies  concerning  the 
psychophysical  determination  of  phase  sensitivity  In  three 
fundamental  respects.  First,  the  philosophical  approach  to  the 
problem  is  different.  Previous  studies,  using  a variety  of  stimuli 
and  psychophysical  methods,  have  indirectly  attempted  to  "prove"  or 
"disprove"  Helmholtz’  phase  rule  by  demonstrating  new  "phase 
effects."  The  results  of  those  studies,  from  the  second  half  of  the 
nineteenth  century  to  date,  have  been  conflicting.  Earlier  failures 
In  demonstrating  phase  sensitivity  are  probably  due  to  equipment 
limitations.  Any  successful  demonstration  of  phase  sensitivity  was 
attributed  to  experimental  artifacts  as  discussed  by  Beasley 
(1930a),  One  must  be  Impressed  by  the  fact,  houever,  that  since  the 
second  quarter  of  the  current  century  almost  all  phase  sensitivity 
studies,  uithout  exception,  have  demonstrated  the  ear’s  sensitivity 
to  phase  modifications  of  one  type  or  another.  Notwithstanding  the 
success  of  the  later  experiments,  there  has  yet  to  be  a critical 
study  unifying  the  results  into  a comprehensive  theory  of  phase 
sens) tlvi ty. 

The  approach  taken  by  the  author  In  this  study  is  neither  to 
prove  or  disprove  the  phase  rule  or  to  demonstrate  a new  phase 
effect.  It  ulll  bo  clsar  from  the  review  of  literature  In  Chapter 
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III  that  the  ear  Is  sensitive  to  phase.  The  phase  effects 
demonstrated  by  earlier  investigators  have  generally  relied  on 
specialized  stimuli  not  normally  encountered  in  the  every  day 
acoustic  environment.  The  author,  accepting  the  sensitivity  of  the 
ear  to  phase,  proposes  to  measure  the  sensitivity  by  systematically 
modifying  the  phase  of  impulses  and  speech  sounds.  The  results  of 
the  study  may  be  more  pragmatic  than  many  of  the  other  studies 
discussed  in  the  review  of  literature  but  none  the  less  applicable 
to  theoretical  insights. 

The  second  respect  In  which  the  present  study  differs  from 
previous  ones  is  in  the  computer  instrumentation  of  the  study.  It 
Is  believed  by  the  author  that  the  computer  has  been  put  to  fairly 
sophisticated  use  during  the  course  of  the  Investigation  to  be 
discussed.  The  uses  to  which  the  computer  has  been  put  will  be 
diecussed  in  greater  detail  throughout  this  report.  It  ie  hoped 
that  the  discussion  will  be  of  profit  to  future  psychophysical 
experimentation  of  the  type  described  herein. 

The  third  respect  in  which  the  study  differs  ie  in  the 
further  development  of  the  computer  as  a basic  research  tool. 

The  significance  of  the  study  is  inextricably  involved  with 
it’s  uniqueness.  The  determination  of  phase  sensitivity  has  been 
ehoun  to  be  interesting  for  both  theoretical  and  practical 
coneiderat ions.  The  very  least  application  to  which  the  resulte  may 
be  applied  is  in  the  "worse  case"  design  of  signal  processing 
systems.  The  instrumentation  of  the  study  and  the  development  of 
digital  techniques  are  of  similar  significance. 
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In  summary,  a study  Is  to  bs  presented  pertaining  to  the 
phase  sensitivity  of  the  ear.  The  role  of  psychophysics  In  the 
design  of  speech  processing  systems  and  the  use  of  computers  in 
psychophysical  research  has  been  discussed.  The  present  etudy  is 
describe^  as  being  unique  because  of  Its  approach  to  the  phaea 
sensitivity  problem,  Its  instrumentation,  and  its  development  of 
digital  techniques. 


CHAPTER  II 


THE  PHASE  SENSITIVITY  QUESTION 

Acoustic  signals  mag  be  analyzed  by  Fourier's  methods  Into 
their  constituent  frequency  components.  Such  analysis  may  be 
effected  mathematically,  mechanically,  optically,  electrically,  and 
digitally.  That  the  ear  may  also  effect  frequency  analysis  after 
the  manner  of  Fourier  uas  first  stated  by  Ohm  in  1843.  Helmholtz 
(18G3)  verified  the  lau  and  as  a result  of  his  own  investigation 
added  the  generally  accepted  dictum  that  the  musical  quality  of  a 
musical  tone  Is  phase  independent.  The  ear  does  appear,  houever,  to 
be  sensitive  to  phase  modifications  of  certain  types.  Determination 
of  the  way  and  to  what  extent  the  ear  is  sensitive  to  phase  is 
termed  by  the  author  to  be  the  phase  9ensi tivi ty  quest  1 on.  There 
has  been  much  work  since  the  time  of  Helmholtz  concerned  with 
determining  all  the  varied  ramifications  of  the  phase  sensitivity 
question.  The  following  discussion  is  expedient  to  clarify  the 
phase  sensitivity  question  and  to  put  the  current  work  into  its 
proper  perspective. 

The  Time  and  Frequency  Domain  Representation 
of  Acoustic  Signals 

Acoustic  signals  are  the  spherical  propagation  of  acoustic 
energy  from  a given  source.  The  Information  content  of  such  signals 
Is  represented  as  atmospheric  pressure  variations  as  measured  at 
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eome  fixed  point  from  the  source.  A graphical  representat  Ion  in 
which  the  pressure  or  velocity  variations  are  plotted  as  a function 
of  time  Is  knoun  as  the  signal  waveform. 

Waveforms  may  be  described  in  terms  of  mathematical 
functions.  It  was  generally  known  before  the  time  of  Fourier*  that 
continuous  functions  could  be  represented  ae  the  sum  of  sinusoids  of 
various  periods,  amplitudes,  and  phase  relationships,  but  It 
remained  for  Fourier  to  demonstrate  that  discontinuous  waveforms 
could  also  be  so  represented. 

It  follows  from  the  preceding  that  a function  may  be 
represented  aB  either  a waveform  in  the  so  called  t i me  doma i n or  as 
the  weighted  sum  of  sinusoids  in  the  so  called  fraouancu  or  Four  I er 
domain.  Changing  from  one  representation  to  the  other  ie  termed  a 
transf ormat i ont  In  particular,  changing  from  the  time  domain  to  the 
frequency  domain  is  termed  the  Four i er  transformat i on.  the  Inverse 
process  being  termed  the  i n verse  Four i er  transformat  I on. 
Transformations  are  performed  in  order  to  exploit  the  properties  of 
one  domain  or  the  other.  Common  examples  of  transformat i ona I 
exploitation  are  found  in  statistics,  the  use  of  logarithms,  etc. 
The  particular  properties  of  the  Fourier  transformation  of  interest 
to  this  study  will  be  discussed  in  Chapter  IV,  but  the  specific 
purpose  of  the  present  introduction  Is  to  provide  a historical 


IJean  Baptiste  Joseph  Fourier  (17G8  - 1830),  a distinguished 
French  mathematician. 

2Cul  I lemln  (1983),  has  noted  that  Euler  and  contemporary 
mathematicians  felt  that  such  representation  was  Impossible  except 
for  continuous  functions.  Although  Fourier  did  not  offer  rigorous 
proof,  such  proof  was  demonstrated  by  Dirichlet  in  1837. 


10 


perspective  for  Ohm's  contribution  to  the  science  of  hearing. 

Ohm's  Law  of  Acoustics 

The  essence  of  Ohm’s  lau  of  acoustics  Is  that  the  ear 
performs  a Fourier  analysis  on  an  acoustic  signal  by  transforming 
the  signal  Into  sinusoidal  components  which  the  ear  individually 
perceives.  Ohm  (1843)  developed  the  lau  in  part  from  the  1841 
experimental  work  of  Seebeck,  who  Uightman  and  Green  (1974)  cite  as 
probably  being  the  first  to  conduct  a systematic  Investigation  of 
pitch  perception.  Through  a series  of  experiments  involving  the*  use 
of  a siren,  Seebeck  concluded  that  the  pitch  of  a sound  was  related 
to  the  periodicity  of  the  waveform  rather  than  the  spectral  energy 
at  the  reciprocal  of  the  periodicity  of  the  waveform,  l.e.,  the 
fundamental  frequency. 

Ohm  took  issue  with  Seebeck’ e conclusion  in  that  Ohm 
believed  that  a pitch  could  be  heard  only  if  the  acoustic  signal 
contained  power  at  that  frequency.  Ohm  demonstrated  by  use  of  the 
Fourier  transformation  that  Seebeck’ a waveform  did  in  fact  contain 
the  required  frequency  components,  but  at  low  intensities.  Seebeck, 
In  turn,  objected  to  Ohm’s  explanation  because  the  amplitude  of  the 
components  were  too  low  to  account  adequately  for  the  intensity  of 
the  pitch  which  could  be  heard.  The  controversy  was  tentatively 
resolved  twenty  years  later  by  Helmholtz,  who  poetulated  that 
nonl Inear  distortion  In  the  middle  ear  would  produce  distortion 
products  that  would  combine  with  the  incidental  signal  so  as  to  be 
analyzable  by  the  ear  into  component  frequencies  as  explained  by 
Ohm's  law. 
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Verification  of  Ohm’s  Law  of  Acoustics 

Helmholtz  (1863),  as  part  of  his  investigation  Into  the 

phgsics  and  psychophysiology  of  music,  had  occasion  to  determine 

whether  or  not  the  ear  analyzed  an  acoustic  signal  or  not  tp.  33), 

and  reasoned  that  inasmuch  as  the  ear  is  able  to  analyze  an  acoustic 

signal  produced  by  two  tuning  forKs  that  it  must  also  be  able  to  so 

analyze  a note  produced  by  a single  instrument  such  as  a flute  or 

organ  pipe.  Helmholtz  cited  Ohm  as  having  first  laid  down  the  rule 

by  which  the  ear  analyzes  acoustic  signals  and  according  to  hie 

understanding  of  the  rule,  stated  that 

Every  motion  of  the  air,  then,  which  corresponds  to  a 
composite  mass  of  musical  tones,  is,  according  to  Ohm's  law, 
capable  of  being  analyzed  Into  a sum  of  simple  pendular 
vibrations,  and  to  each  such  single  simple  vibration 
corresponds  a simple  tone,  sensible  to  the  ear,  and  having  a 
pitch  determined  by  the  periodic  time  of  the  correspond! ng 
motion  of  the  air. 

Having  so  restated  the  law,  Helmholtz  tp.  521  proceeded  to 
prove  It  by  the  expedient  of  shouing  that  the  ear  does  not  perceive 
frequency  components  when  they  are  not  part  of  an  acoustic  signal. 
Uhether  or  not  the  components  were  present  in  a signal  was 
determined  either  by  mathematical  calculation  or  by  "sympathetic 
resonance,"  i.e. , by  the  use  of  tuned  resonators  which  would  amplify 
the  given  component  In  the  signal  such  that  it  would  be  audible.  A 
elgnal  produced  by  a vibrating  string  was  used  because  the 

constituent  frequencies  of  the  sound  could  be  easily  changed  by  the 
manner  and  the  spot  in  which  the  string  was  excited,  and  because  of 
the  ease  In  which  the  theoretical  or  experimental  analysie  of  the 
sound  could  be  effected. 
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Helmholtz  used  the  principle  of  sympathetic  vibration  of 
other  strings  and  resonators  for  the  experimental  analysis  of  a 
vibrating  string.  He  also  used  the  more  direct  approach  of  touching 
the  etrlng  at  1 1 6 various  vibrational  nodes  and  observing  which 
components  disappeared.  The  reader  is  referred  directly  to 
Helmholtz  [p.  52)  for  a fuller  appreciation  of  the  simplicity  and 
elegance  In  which  the  frequency  component  composition  of  a vibrating 
etrlng  may  be  determined  simply  by  touching  the  string  at  its 
vibrational  nodes. 

By  the  expedient  of  damping  vibrating  strings  at  various 
points,  thereby  removing  the  corresponding  frequency  components,  and 
then  demonstrating  that  the  ear  could  no  longer  hear  that  particular 
frequency  component,  Helmholtz  demonstrated  that  the  ear  analyzed 
the  sound  Into  the  same  components  as  would  be  analyzed  by  physical 
analysis.  Notwithstanding  the  inability  of  the  ear  to  recognize 
among  the  strong  components  of  a signal  all  the  components 
detectable  with  the  resonators,  Helmholtz  concluded  his  proof  with 
the  statement  (p.  561 

The  ear  recognizes  without  resonators  the  simple  tones 
[sinusoidal  components]  which  the  resonators  greatly 
reinforce,  and  perceives  no  upper  partial  tone  Ci.e.,  other 
frequency  components]  which  the  resonator  does  not  indicate. 

To  verify  this  conclusion,  I performed  numerous  experiments, 
both  with  the  human  voice  and  the  harmonium,  and  they  all 
confirmed  it. 

Helmholtz  not  only  supported  Ohm’s  law  but  also  provided  a 
physiological  basis  for  Its  operation.  The  basilar  membrane  of  the 
middle  ear,  based  on  the  anatomical  discoveries  of  Corti  in  1851, 
was  postulated  to  be  composed  of  a series  of  transversely  stretched 
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fibers,  each  of  which  was  resonant  to  a different  frequency.  An 
acoustic  signal  vibrating  the  membrane  would  consequently  excite 
only  those  fibers  which  were  tuned  to  the  resonant  frequency 
components  In  the  signal.  Corresponding  individual  sensations  would 
follow  from  the  doctrine  of  specific  nerve  energies  stated  by  Muller 
In  1836. 

The  Sound  Quality  Question 

Helmholtz  was  interested  in  determining  why  different 
musical  Instruments  (Including  the  human  voice)  had  different  Bound 
qualities,  e.g.,  that  peculiar  property  of  sound  that  enables  a 
listener  to  recognize  its  source.  As  introduced  in  Chapter  I, 
Helmholtz  ehoued  that  the  only  attribute  of  a sound  to  which  quality 
could  be  correlated  was  the  shape  of  its  representative  waveform. 
Helmholtz  [p.  65)  admitted  that  the  reason  for  the  conclusion  was 
negative  but  proceeding  from  the  results  discussed  in  the  last 
section  concluded  that  musical  tones  of  the  Bame  quality  would 
always  be  composed  of  the  same  frequency  components  inasmuch  as  it 
Is  the  components  that  elicit  their  corresponding  sensations.  The 
natural  question  following  the  conclusion  is  to  what  extent  the 
difference  in  quality  can  be  explained  by  the  combination  of 
different  components  of  various  amplitudes. 

Helmholtz  was  able  to  demonstrate  through  a series  of 
experiments  that  sound  quality  was  correlated  with  the  weighted  sum 
of  frequency  components  constituting  the  sound.  Helmholtz  found  it 
Important  to  note  that  quality  is  not  to  be  confused  with  the 
peculiarities  of  how  the  sound  begins  or  ends.  Sounds  produced  by 
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musical  instruments  may  build  up  and  die  auay  at  different  rates, 
etc.  Helmholtz  further  noted  that  even  when  a musical  tone  Is 
uniformly  sustained  that  certain  noises  may  accompany  it  such  as  the 
hissing  of  air  in  wind  instruments,  the  rubbing  of  a violin  bow, 
etc.  In  light  of  these  considerations,  Helmholtz  Cp.  G7I  defined 
musical  qua  1 1 tu  as  follows. 

Uhen  ue  speak  in  what  follows  of  musical  qua  I i tu  of  tone,  we 
shall  disregard  these  peculiarities  of  beginning  and  ending, 
and  confine  our  attention  to  the  peculiarities  of  the 
musical  tone  which  continues  uniformly. 

The  Phase  Rule 

After  investigating  the  musical  quality  of  various  musical 
instrumsnts  and  the  human  voice,  Helmholtz  [p.  119]  turned  his 
attention  toward  determining  the  Importance  of  the  phase 
relationships  among  ths  constituent  frequency  compc'snte  to  Bound 
quality*  It  should  be  noted  that  the  sounds  under  investigation 
were  musical  tones,  l.s.,  sounds  which  are  periodic  as  opposed  to 
irregular  motions  of  the  air  such  as  noise.  Helmholtz  pursued  hie 
goal  by  using  various  synthesized  musical  tones  which  imitated 
vowels  to  determine  whether  or  not  a difference  in  quality  was 
detectable  when  the  phase  was  varied. 

A battery  of  electrically  driven  tuning  forks  was  used  In 
the  synthesis.  Phase  could  be  altered  by  bringing  the  reeonancs 
chamber  of  a given  fork  slightly  out  of  tune  with  the  fork.  Every 
phase  condition  was  possible  using  this  technique.  Placing  ths 
chambers  out  of  resonance  also  weakened  the  sound  of  the  fork  but 
euch  weakening  was  compensated  by  adjusting  the  distance  between  ths 
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other  forka  and  their  corresponding  resonators.  The  same  effect 
could  be  achieved  bg  slightly  mistuneing  the  forks,  e.g.,  by  adding 
drops  of  sealing  wax  to  the  tines.  In  such  cases,  the  actual  phases 
could  be  measured  by  viewing  Lissajous  figures  by  means  of  a 
vibration  microscope  tp.  126) . 

Helmholtz  experimented  with  many  tones  with  different  phases 
and  was  never  able  to  experience  any  difference  In  the  quality  of 
the  tone.  He  found  that  it  made  no  difference  whether  he  weakened 
the  constituent  frequency  components  by  detuning  their  resonant 
cavities,  thereby  shiftirfg  their  phase,  or  by  moving  the  resonators 
further  from  their  forks.  Helmholtz  tp.  126)  laid  down  the 
following  rule  as  a result  of  these  experiments! 

. . . the  quality  of  the  musical  portion  of  a compound  tone 
depends  solely  on  the  number  and  relative  strength  of  i 1 6 
partial  simple  tones,  and  in  no  respect  on  their  differences 
of  phase. 

Helmholtz  qualified  his  rule,  however,  with  the  following 
restriction  (p.  1271 i 

It  must  be  here  observed  that  we  are  speaking  only  of 
musical  quality  as  previously  defined.  When  the  musical 
tone  is  accompanied  by  unmusical  noises,  such  as  jarring, 
scratching,  soughing,  whizzing,  hissing,  these  motions  are 
either  not  to  be  considered  as  periodic  at  all,  or  else 
correspond  to  high  upper  partials,  of  nearly  the  same  pitch, 
which  consequently  form  strident  dissonances.  We  were  not 
able  to  embrace  these  in  our  experiments,  and  hence  we  must 
leave  it  for  the  present  doubtful  whether  in  such 
dissonating  tones  difference  of  phase  is  an  element  of 
importance.  Subsequent  theoretic  considerations  will  lead 
ue  to  suppose  that  it  really  is. 


The  Validity  of  Ohm’s  Law 

The  validity  of  Ohm's  law  has  been  discussed  elnce  Ite  flret 
publication.  It  becomes  apparent,  however,  in  reviewing  the 
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literature  concerning  the  lau  that  there  are  differencee  In  opinion 
aa  to  what  constitutes  the  lau  (See  Appendix  A],  It  ie  general  Ig 
accepted  that  the  lau  pertains  to  the  frequency  analyzing  ability  of 
the  ear.  Confusion  arises  as  to  uhether  or  not  Ohm  included 
statements  In  hie  lau  pertaining  to  phase  sensitivity  or  If  such 
etatements  uere  later  added  by  Helmholtz.  A careful  reading  of  Ohm 
and  Helmholtz  reveals  that  Ohm  did  not  conduct  any  experiments  on 
phase  or  make  any  statements  concerning  phase  sensitivity.  Such 
statements  uere  later  added  by  Helmholtz  as  a result  of  hie  own 
work.  Goldstein  (1967)  gives  the  interpretation  that  Helmholtz’ 
phase  rule  complements  Ohm’e  lau  in  that  the  spectral  components  one 
may  "hear  out"  due  to  the  Fourier  analysis  are  phase  Independent. 

This  report  Is  mainly  concerned  with  the  phase  eeneitivity 
question.  It  has  been  necessary,  houever,  to  consider  the  phase 
question  up  to  this  point  in  conjunction  with  Ohm’s  lau  for  two 
reasons.  The  first  reason  is  that  it  was  probably  Helmholtz  who  wae 
first  to  make  a definitive  statement  about  phase  sensitivity.  To 
the  extent  that  Helmholtz  demonstrated  the  correlation  betueen  the 
musical  quality  of  a tone  and  the  weighted  sum  of  Its  constituent 
frequency  components,  and  that  the  ear  analyzed  the  sound  according 
to  the  tenets  of  Ohm's  lau,  the  uork  of  Ohm  and  Helmholtz  are 
Inseparably  connected.  The  second  reason  for  the  Joint 
consideration  of  Ohm’s  analytic  lau  and  Helmholtz*  phase  rule  is  the 
previously  discussed  confounding  to  the  two. 

Exceptions  to  Ohm’s  lau  as  defined  in  its  restricted  sense 
are  not  directly  pertinent  to  the  thesis  under  considerat  on  but 
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because  of  the  long  time  intimacy  between  the  work  of  Ohm  and 
Helmholtz,  it  must  be  mentioned  for  completeness  that  there  appear 
to  be  minor  exceptions  to  Ohm’s  law. 

Richardson  (1927)  has  Indicated  that  Ohm’s  law  holds  fairly 
well  for  weak  sounds  and  that  "the  instances  where  it  seems  to  be 
untrue  can  be  explained  In  the  main  as  aural  illusions,  that  Is  to 
say,  that  their  cause  is  psychological."  Richardson  stated  that 
"this  is  not  a refutation  of  Ohm’s  law,  at  least  in  principle,  but 
may  simply  imply  the  intrusion  of  other  simple  tones  not  in  the 
external  sound,  into  the  quality  of  the  note  as  perceived  by  the 
ear."  Richardson  cited  subjective  combination  tones  as  an  example  of 
such  intrusions. 

Trimmer  and  Firestone  (1937)  noted  that  "these  empirical 
facts — the  validity  of  Ohm’s  law  at  low  amplitudes  and  the 
exceptions  at  higher  amplitudes — are  uell  established  . . . 
Stevens  and  Davis  (1938)  noted  in  reference  to  Ohm’s  law  of 
acoustics  that  ",  . . the  ear  Is  In  general  able  to  detect  the 
presence  of  component  frequencies  In  a sound-wave  and  to  Identify 
their  pitch  provided  they  are  not  too  numerous  or  too  faint." 

Monaural  Phase  Effects 

It  may  be  generally  said  that  the  ear  is  not  indifferent  to 

phase  if  differences  in  signal  quality  is  perceived  as  a result  of 

phase  modifications  within  the  signal  complex.  Such  observations  of 

quality  change  are  referred  to  in  the  literature  as  monaural  phase 

effects,  and  are  defined  by  Trimmer  and  Firestone  (1937)  as  follows! 

Suppose  an  observer  listens  to  a combination  of  n objective 
tones,  all  having  absolutely  fixed  frequencies,  amplitudes 


18 


and  phases.  Let  Po  represent  the  periodic  pressure  wave 
made  up  of  these  tones.  Let  the  observer  be  simultaneously 
presented  with  an  exploring  tone  Pe — that  is,  a tone  of 
which  the  frequency,  the  amplitude  and  the  phase  are  all 
ad justabl ei 

Pe  • A cos  (2nft  + 0) 

If,  for  given  Po,  It  is  possible  to  find  values  of  A and  of 
f such  that  the  ear  hears  changes  in  the  combined  sound  of 
Po  and  Pe  as  0 changes,  a (monaural)  phase  effect  is  said  to 
be  observed.  These  changes  in  the  heard  sound  may  be  of 
loudness,  pitch  or  quality. 

These  authors  stated  that  the  monaural  phase  effect  is  not  to  be 
confused  with  the  binaural  phase  effect  which  is  important  In  eound 
local Izatlon. 

It  is  possible  to  generalize  the  concept  above  to  cover 

Instances  In  which  the  "exploring  tone"  is  implicit  in  the  signal 

complex  rather  then  existing  as  an  explicit  entity  as  described  in 
the  definition  above. 

This  definition  of  a monaural  phase  effect  appears  to  be 

commonly  accepted  in  the  literature.  Stevens  and  Davie  (1938) 

tp.  2031  stated  that  "those  experiments  in  which  an  auxiliary  tone 

was  made  to  beat  with  an  aural  harmonic  prove  definitely  that  the 

phaee-relatlone  among  the  harmonic  components  of  a stimulus  are 

detectable,  for  otherwise  these  beats  could  not  occur,"  i.e.,  the 

percept  Ion* of  beats  is  a phase  effect.  It  is  not  clear  that  only  a 

single  mechanism  accounts  for  the  detectability  of  all  phase 

effects.  Helmholtz  Ip.  1271  apparently  did  not  intend  to  use  a 

definition  of  phase  sensitivity  as  broad  as  that  defined  by  monaural 

phase  effects,  for  In  reference  to  his  phase  rule,  he  wrotei 

An  apparent  exception  to  this  rule  must  here  be  mentioned. 

If  the  forks  B|?  and  b|?  are  not  perfectly  tuned  ae  Octavee, 
and  are  brought  into  vibration  by  rubbing  or  striking,  an 
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attentive  ear  will  observe  very  weak  beats  which  appear  like 
small  changes  in  the  strength  of  the  tone  and  its  quality. 
These  beats  are  certainly  connected  with  the  successive 
entrance  of  the  vibrating  forks  on  varying  difference  of 
phase.  Their  explanation  will  be  given  when  combinational 
tones  are  considered,  and  it  will  then  be  shewn  that  these 
slight  variations  of  quality  are  referable  to  changes  in  the 
strength  of  one  of  the  simple  tones. 

Beasley  (1931)  noted  that  Helmholtz  considered  "mueical 
quality"  to  mean  "vowel  quality"  and  that  Helmholtz  did  not  coneider 
relative  changes  in  loudness  to  be  variations  in  "musical  quality." 
As  a further  example,  perhaps,  of  not  wishing  to  confound 
discrimination  of  phase  shifts  and  discrimination  of  amplitude 
changes,  even  though  the  amplitude  changes  result  from  phase 
modifications,  Hansen  and  Madsen  (1974)  took  care  in  their  study  of 
phase  sensitivity  to  use  a stimulus  in  which  the  amplitude  changes 
were  held  constant. 

Thompson's  (1877)  discovery  that  beats  could  be  detected 
when  Individual  tones  were  presented  separately  to  each  ear  may  be 
possible  justification  for  restricting  the  definition  of  monaural 

S 

phase  effects  to  exclude  amplitude  changes.  Rayleigh  (1907) 
repeated  Thompson's  experiment  and  received  the  same  results.  It  Is 
not  entirely  clear  that  such  restrictions  should  be  placed  on  the 
definition  of  monaural  phase  effects  but  one  should  be  aware  of  the 
possibility  of  different  classes  of  phase  effects  and  different 
mechanisms  to  account  for  them.  The  definition  of  phase  deafnese 
appears  to  be  subject  to  the  investigator's  interpretation. 

In  declining  to  classify  his  "apparent"  exception  to  the 
phase  rule  as  a phase  effect,  Helmholtz  Is  said  to  have  Introduced  a 
semantic  problem.  Koenig  (1881)  (p.  537)  objected  in  that  If  tone 
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quality  does  partly  depend  on  the  relative  strength  of  the  higher 
frequency  components,  and  If  the  relative  strength  ie  modified  by 
phaee,  then  the  effect  of  the  phase  is  "actual,  and  not  merely 
apparent. " 

It  ie  important  to  distinguish  phase  effects  from  phaee 
eeneltlvlty.  Phase  effects  arise  from  arbitrary  manipulations  of 
phase,  e.g.,  setting  the  phase  of  all  frequency  components  to  zero. 
Phaee  sensitivity  on  the  other  hand  is  to  be  understood  to  arise 
only  from  phase  modifications  that  may  be  effected  by  linear, 

f 

stationary  systems.  The  reason  for  the  distinction  ie  that  the 
phaee  modifications  used  in  the  study  are  effected  by  linear, 
stationary  systems. 

The  Validity  of  the  Phase  Rule 

It  is  not  clear  that  any  of  the  studies  to  be  cited  relevant 
to  the  phase  sensitivity  question  have  been  explicitly  designed  to 
"prove"  or  "disprove"  Helmholtz’  phase  rule  or  its  exception. 
Indeed,  to  do  so,  the  study  would  of  necessity  have  to  use  the  same 
type  stimulus  used  by  Helmholtz  in  formulation  of  the  rule.  Some  of 
the  studies  to  be  cited  have  used  simple  sinusoidal  components  as 
the  stimulus,  however,  and  have  accordingly  made  reference  to  the 
validity  of  the  phase  rule  or  its  exception. 

Goldstein  (1967)  noted  that  based  on  Helmholtz*  "definitions 
of  musical  and  nonmusical  sounds,  his  inexplicit  restrictions  on  hie 
phase  rule,  and  his  concept  of  limited  frequency  resolution,  It  Is 
clear  that  Ohm's  lau  (I. a.  the  phase  rule!  as  commonly  conceived  Ie 
valid  only  for  compound  tones  ulth  relatively  large  frequency 
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separations  between  constituent  simple  tones ” 

The  idea  of  "proving"  or  "disproving"  Helmholtz*  phase  rule 
may  not  be  a valid  concept  as  the  question  Is  commonly  approached. 
As  noted  by  Licklider  (1951),  Helmholtz  never  Intended  his  rule  to 
be  so  generally  interpreted.  The  phase  rule  as  defined  by  Helmholtz 
is  almost  certainly  correct.  A cogent  objection  to  too  heavy  a 
reliance  upon  Its  significance,  however,  Is  that  it  pertains  only  to 
a very  small  subset  of  the  day-to-day  signals  encountered  by  the 
average  ear,  and,  inasmuch  as  phase  can  be  detected  as  wi  I I be 
discussed  in  Chapter  III,  the  rule  provides  little  insight  into  the 
operation  of  any  phase  detecting  mechanism  of  the  ear. 

Significance  of  the  Phase  Sensitivity  Question 
The  question  as  to  whether  or  not  the  ear  ie  sensitive  to 
phase  le  so  poorly  formed  that  it  is  of  very  little  value.  The 
I N -condi 1 1 on i ng  of  the  question  rests  entirely  on  the  generality  oi 
the  word  "phase."  A straight  forward  psychophysical  experiment  mail 


be  conducted 

to  reveal 

whether 

or  not  the  ear 

i 8 

able  to 

discriminate 

between  an 

original 

acoustic  signal 

and 

a phase 

modified  version.  Proceeding,  one  may  conduct  a second  study  with 
another  particular  type  of  phase  modification.  It  is,  in  fact,  this 
process  that  is  represented  in  the  review  of  literature  to  be  .cited 
In  Chapter  III.  Given  the  process  above,  one  may  determine  whether 
or  not  the  ear  is  sensitive  to  a particular  type  of  phase 
modification.  The  degrees  of  freedom  in  adjusting  the  phase  In  a 
given  sxpsrlmsnt,  however,  Is  without  bound.  Cited  studies  will 
show  that  the  ear  Is  conclusively  sensitive  to  phase  modi  fleet  lone. 
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Inasmuch  as  phass  is  a general  term,  one  must  conclude  that  if  the 
ear  Is  sensitive  to  ang  tgpe  of  phase  modification  that  It  Is 
sensitive  to  phase.  This  conclusion,  however,  Is  unfortunately  of 
as  little  value  as  the  question  it  proposes  to  aneuer. 

The  author  submits  that,  except  for  its  historical  value, 
the  validity  of  the  phase  rule  is  of  little  interest.  Rather  than 
determine  if  the  phase  rule  is  valid  or  not,  the  proper  approach  to 
accessing  the  phase  sensitivity  of  the  ear  is  to  determine  in  what 
way  and  to  what  extent  the  ear  is  sensitive  to  phase  modification 
within  acoustic  signals  and  to  determine  the  nature  of  the 
sensations  of  the  sensitivity.  This  problem  may  be  approached  as  a 
study  to  determine  the  perceptible  and  tolerable  limits  of  the  ear 
to  phase  distortion  which  Is  synonymous  with  phase  changes.  The 
author  proposes  to  determine  the  general  limits  of  phase  sensitivity 
by  using  impulses  and  speech  sounds  as  stimuli  in  a series  of 
discrimination  tests.  A "general"  type  of  phase  distortion  based  on 
a stylized  analysis  of  phase  distortion  as  discussed  in  Chapter  V 
will  be  used  to  systematically  modify  the  stimuli  such  that  phase 
d i ecr  1 mi  nab  I I I ty  may  be  determined  as  functions  of  the  parameters  of 
the  stylized  distortion. 


CHAPTER  III 


PERTINENT  LITERATURE  CONCERNING  PHASE  SENSITIVITY 


The  purpose  of  this  chapter  is  to  review  relevant  phase 

literature  demonstrating  that  the  ear  is  undoubted  I y sensitive  to 

phase.  Very  early  studies  concerning  the  sensitivity  of  the  ear  to 

phase  have  been  revleued  by  Beasley  (1931). 

Konig,  1881,  conducting  experiments  with  a wave  siren, 
concluded  that  tone  quality  is  changed  with  phase 
displacement  of  a harmonic.  Inasmuch  as  a nave  siren  does 
not  produce  a fidelity  reproduction  of  the  waveform  cut  in 
the  siren  disk,  however,  his  results  are  not  conclusive. 
Hermann,  1894,  also  using  a siren,  concluded  that  phase 
displacements  are  irrelevant  and  criticized  Konig* s work  on 
the  basis  that  the  intensity  variations  resulted  from 
changes  In  the  wave  pattern  on  the  siren  disk.  Gray,  1899, 
using  tuning  forks,  concluded  that  "the  single  ear  can 
distinguish  no  difference  between  two  phases  of  a simple 
pure  tonei  neither  can  the  ear  distinguish  a phase  change 
in  a complete  harmony."  Emile  ter  Kuile,  1902,  using  tuning 
forks  to  produce  tertiary  harmonies  in  which  one  of  the 
forks  was  slightly  mistuned  so  as  to  produce  slow  beats, 
produced  definite  changes  in  quality.  Thie  same  type  of 
stimulus,  i.e.,  one  using  a "floating"  or  continuous  change 
of  phase,  but  produced  by  a tone  Phaser,  was  used  by 
Beasley  in  his  own  experiments.  Lindig,  1903,  using  a 
"telephone  siren,"  concluded  that  tone  quality  was 
influenced  only  by  beats  between  identical  frequencies 
introduced  by  overtones  common  to  different  systems  in 
combination.  Lloyd  and  Agnew,  1909,  using  telephone 
receivers  and  special  alternating  current  generators, 

concluded  that  quality  was  not  affected  by  phase. 

Hartrldge  and  Cosen9,  1922,  concluded  that  "change  of  phase 
affects  the  quality  of  a musical  chord  if  its  constituent 

tones  are  accompanied  by  harmonics)  but  with  pure  tones, 

free  from  harmonics,  change  of  phase  does  not  audibly 
affect  the  quality  of  the  mixed  tone." 

Beasley  also  cited  a number  of  authors  who  theorized  about  phase 

sensitivity  but  did  not  cite  their  experimental  work.  If  any. 
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The  type  of  experiment  performed  to  acceee  the  eeneitivlty 
of  the  ear  to  phase  may  be  broadly  classified  Into  tuo  types 
depending  on  the  intensity  of  the  stimulus.  The  stimulus  may  be  of 
weak  or  moderate  intensity  such  that  the  ear  Is  not  overloaded  to 
produce  aural  nonlinear  distortion,  or  the  stimulus  may  be  of 
sufficient  intensity  to  force  the  ear  into  a nonlinear  mode  of 
operation.  It  must  be  understood  at  the  onset,  however,  that  the 
general  consensus  is  that  phase  effects  are  in  fact  mediated  by  the 
nonlinearities  of  the  ear.  The  ear  is  generally  modeled  as  a linear 
system,  driven  with  lou  amplitude  signals.  The  model  may  be  too 
simple.  That  the  ear  is  nonlinear  is  well  understood;  nonlinearity 
may  arise  apparently  from  any  part  of  the  auditory  system.  Research 
of  the  type  in  which  the  stimulus  intensity  is  certain  to  produce 
nonlinear  distortion  will  be  discussed  first. 

Chapin  and  Firestone  (1934),  in  the  attempt  to  explain 
masking,  difference  tones,  and  certain  kinds  of  beats,  demonstrated 
that  when  a 108  Hertz  signal,  strong  enough  to  produce  nonlinear 
distortion,  and  one  of  Its  harmonics  were  presented  to  a listener  In 
various  relative  phase  conditions  and  intensities,  that  the  tone 
quality  and  loudness  were  influenced  significantly  by  the  phase 
relation  of  the  two  components.  The  authors  concluded  that  the  ear 
dlstorte  the  input  signal  so  as  to  constructively  or  destructively 
combine  distortion  products  with  the  original  signal. 

Lewis  and  Larsen  (1937)  demonstrated  phase  effects  with  a 
difference  tone  of  130  Hertz  generated  by  two  frequencies  of  390  an  i 
S20  Hertz  at  70  dB  SPL.  They  measured  the  Intensity  of  a 130  Hart.: 
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exploring  tone,  presented  In  various  phases,  that  was  necessary  to 
make  the  two  component  signal  just  noticeably  louder.  The  phase 
angle  at  the  minimal  intensity  at  which  each  of  the  two  subjects 
could  detect  a change  varied  for  each  subject  by  120  degrees.  Ulhen 
the  minimum  phase  for  each  subject  was  shifted  by  180  degrees  each 
subject  reported  the  sound  as  being  at  Its  softest. 

Uhen  the  ear  is  forced  into  a nonlinear  region  of  operation 
by  a test  stimulus  of  excessive  intensity,  the  stimulus  quality  will 
vary  with  the  onset,  grouth,  and  phase  relations  of  the  distortion 
products.  Although  work  in  this  area  will  likely  lead  to  further 
understanding  of  the  auditory  system  and  further  revision  of  the 
theories  of  hearing,  it  is  not  clear  that  data  gathered  from  such 
experiments  in  which  the  ear  is  operating  in  an  abnormal  mode  will 
be  directly  applicable  to  discovering  any  basic  phase  sensitivity 
mechanism  of  the  ear. 

Research  of  the  type  involving  lower  level  stimuli  has  both 
supported  and  refuted  the  phase  rule.  The  view  that  tone  quality  ie 
Independent  of  the  phase  relationships  among  the  frequency 
components  of  a sound  had  apparently  become  accepted  by  the  time  of 
Beasley's  work  in  the  1930’s.  Beasley  (1930a)  objected  to  this  view 
in  that  a critical  analysis  of  the  experiments  that  supposedly 
supported  the  assertion  revealed  no  evidence  either  to  support  or 
refute  the  view.  In  a detailed  review  of  the  experimental  work  of 
Helmholtz,  1877;  Konig,  1881;  Hermann,  1894;  ter  Kuile,  1902; 
Llndig,  1903;  Lloyd  and  Agnew,  1909;  and  Hartridge  and  Cosens,  1921; 
Beasley  reported  that  a discriminable  change  in  tone  quality 
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Invariably  occurred  in  any  case  in  which  there  wae  a demonstrable 
phase  shift.  Beasley  reported  that  each  Investigator  reviewed 
suggested  that  the  quality  change  could  be  explained  by  "assum 1 na 
periodic  re  I nforcement  and  interference  between  identical 
frequencies  . . . common  to  the  two  generators  used  as  sources  for 
the  fundamental  si  and  that  the  periodic  variation  of  wave  form  Is  I c3 
consequent  upon  a change  in  the  phase  relations  of  the  fundamentals 
is  irrelevant  for  hearing." 

Beasley  objected  to  the  explanation  above  because  the 
interference  effects  were  deduced  from  mathematical  considerations 
rather  than  objective  demonstration,  because  the  nature  of  the 
variations  were  not  described,  and  because  acceptable  proof  of  phase 
Irrelevance  would  depend  on  producing  the  phase  variations  under 
conditions  conducive  to  observing  quality  variations  with  no 
dl ecr Imlnable  quality  variations. 

Bsasley  reviewed  factors  necessary  to  resolve  the  phase 
Independence  question  discussed  in  an  earlier  paper  (Beasley, 
1930b).  In  brief,  these  factors  include  the  use  of  test  stimuli 
with  maximum  waveform  variation,  due  to  phase  variations  between  two 
fundamentals,  for  a variety  of  frequency  ratios,  which  occur  in  a 
period  of  time  psychologically  favorable  for  observing  the  change. 
Under  these  conditions,  the  effect  of  simultaneously  shifting  the 
phase  between  the  fundamentals  and  harmonics  of  identical  frequency 
and  known  magnitude  should  be  contrasted  with  the  effects  of  holding 
the  phaee  relation  between  the  fundamentals  constant  while  altering 
the  phase  relation  between  harmonics  of  Identical  frequency.  The 
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latter  teat  should  be  done  for  cases  In  which  the  harmonics  are 
separated  by  various  intensity  levels,  and  for  cases  in  which  they 
are  of  subthreshold  magnitude.  The  last  factor  is  to  analyze  the 
effect  of  cyclically  varying  the  phase  between  two  pure 
fundamental e. 

Beasley  determined  that  there  were  detectable  monaural  phase 
effects  with  two  pure  tone  stimuli  with  the  frequency  ratio  2i3 
which  were  made  to  vary  slowly  in  their  phase  relationship.  At  a 
signal  sensation  level  of  10  dB  the  changing  phase  conditions  could 
not  be  discriminated  any  better  than  chance  but  at  a sensation  level 
between  25  and  30  dB  90  per  cent  correct  discriminations  were 
possible.  The  subjects  reported  simultaneous  variations  in  pitch, 
quality,  and  pattern.  Even  though  some  subjects  were  able  to 
selectively  respond  to  particular  changes  they  could  not  agree  on 
what  was  changing,  e.g.,  the  pitch,  loudness,  etc. 

Steinberg  (1930)  conducted  experiments  uith  all-pass  systems 
with  monoton  I ca I I y Increasing  phase  characteristics  to  determine  the 
effect  of  dispersion  on  articulation.  Steinberg  conclu'ed  that  the 
major  consequence  of  delay  distortion  wa9,  in  effect,  to  reduce  the 
band  pass  of  the  transmission  system  by  prevent  ng  the  delayed 
frequency  bands  from  contributing  to  articulation. 

The  experimental  work  of  Schouten  (1939)  is  probably  the 
most  recent  reporting  to  support  Helmholtz*  phase  rule  and  to 
demonstrate  that  the  qualification  to  the  rule  is  unnecessary. 
Schouten  described  an  apparatus  used  in  the  synthesis  of  sound. 
Paper  stencils  representing  single  periods  of  two  waveforms  were 
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placed  between  a light  source  and  a photoelectric  cell.  A rotating 
disk  containing  radial  slits,  placed  between  the  6tencile  and 
photoelectric  cell,  was  used  to  vary  the  light  flux  Incident  to  the 
cell  as  a time  function  of  the  desired  waveforms.  Schouten  ueed 
this  device  to  investigate,  among  other  things,  the  influence  of 
phase  on  sound  perception  and  nonlinear  distortion  in  the  ear.  As  a 
result  of  adjusting  the  relative  phase  of  the  two  stencils  and 
listening  to  the  combined  signal,  Schouten  reported  that  "It  uae 
never  possible  to  discern  any  influence  of  phase  on  eound 
perception"  except  in  the  special  case  when  nonlinear  distortion 
occurred  in  the  ear. 

Schouten  pointed  out  that  the  device  could  not  be 
practically  applied  to  shift  the  phase  of  a large  number  of 
components,  notwithstanding  the  importance  of  determining  the 
validity  of  Helmholtz’  rule  for  waveforms  with  a large  number  of 
components.  Schouten  used  four  20-component  waveforms  with 
different  phases  to  tsat  phase  sensitivity  for  cases  with  many 
components.  Schouten  stated  that  "it  was  found  that  these  four 
totally  different  wave  forms  [si cl  were  quite  indistinguishable  as 
to  their  sound  impression,"  which  he  held  to  confirm  Helmholtz’  rule 
for  the  extreme  case  of  waveforms  with  many  harmonic  components.  As 
a practical  consequence,  Schouten  stated  that  one  may  confine 
himself  to  the  measurement  of  harmonic  Intensities  without  regard  to 
phase  In  the  determination  of  nonlinear  distortion. 
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Mathes  and  Hiller  (1947),  using  modulation  techniques  to 
produce  essential Ig  a three  component  signal,  determined  that  the 
envelope  shape  of  a steadg  complex  signal  was  important  in  audibln 
perception.  The  sensations  of  roughness  or  smoothness  were  found  to 
be  influenced  bg  the  envelope  shape  of  the  waveform.  The  shape  was 
also  found  to  be  related  to  a sensation  of  apparent  pitch. 
Differences  in  sensation  could  be  produced  bg  changing  the  phase  of 
onlg  a single  frequencg  component  or  a group  of  componente.  The 
authors  reported  that  their  results  provide  general  verification  of 
the  qualification  placed  bg  Helmholtz  on  hie  phase  rule.  The 
authors  also  noted  that  their  results  emphasized  the  importance  of 
time  factors  in  auditorg  perception. 

Flanagan  (1950,  1951)  studied  the  effects  of  del  aging  or 
advancing  one  frequencg  band  relative  to  the  rest  of  the  spectrum  on 
speech  articulation  and  quality.  He  determined  that  speech 
intelligibility  was  impaired  with  advances  or  delays  of 
approximately  one-quarter  second  when  the  advanced  or  delayed  band 
was  near  the  center  of  the  spectrum. 

LicKlider  (1957)  determined  that  changes  in  the  phase 
relations  of  a 16-component  complex  signal  were  di scrim inable  and  in 
some  Instances  sufficient  to  be  of  importance  in  music.  It  was 
found  that,  in  general,  changing  a high-frequency  component  produced 
more  effect  than  changing  a low-frequency  component. 

Schroeder  (1959)  also  found  a variety  of  subjective  effects 
as  a result  of  varying  the  phase  relationship  in  a complex  acouetlc 
stimulus.  The  signal  contained  up  to  31  harmonics,  A strong 
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dependence  of  the  quality  on  the  "peak  factor"  was  found  to  ex  let. 
Some  adjustment  of  phases  produced  distinct  tones  which  enabled  one 
to  play  simple  melodies. 

Craig  (1961),  and  Craig  and  Jef frees  (1962)  demonstrated 
monaural  phase  effects  using  stimuli  consisting  of  a 250  Hertz 
fundamental  and  its  harmonic  at  various  intensities  and  phase 
relatione.  These  authors  noted  that  previous  studies  tended  to  use 
complicated  stimuli  which  Increased  the  difficulty  of  finding  a 
physiological  explanation  for  the  phase  effects.  The  authors 
reported  striking  Individual  differences  among  subjects  In  their 
responses.  A stimulus  which  was  consistently  Judged  higher  In 
pitch,  louder,  or  purer  by  one  subject  uas  just  as  consistently 
judged  lower',  softer,  or  less  pure  by  another, 

Schroeder  (1966)  has  pointed  out  that  phase  is  a relatively 
minor  factor  in  monaural ly  presented  speech  signals.  There  is  some 
Influence  on  speech  quality  when  the  sound  is  presented  over 
earphones  but  there  is  probably  no  effect  on  intelligibility. 
Schroeder  also  noted  that  phase  distortions  corresponding  to  delay 
distortions  exceeding  50  milliseconds  will  modify  the  short-time 
spectrum  and  will  be  consequentially  detectable  as  a reverberan 
speech  quality.  Uith  sufficiently  large  delays,  the  speech  can  be 
made  unintelligible. 

Goldstein  (1967)  studied  the  relation  of  monaural  phase 
perception  to  limited  auditory  frequency  resolution.  Using 
modulation  techniques  to  produce  the  stimuli,  the  author  was  able  to 
demonstrate  that  phase  effects  disappsar  for  stimulus  bandwidthe 
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greater  than  a value  proportional  to  the  critical  bandwidth  of  the 
carrier  frequency. 

An  indirect  verification  of  the  sensitivity  of  the  ear  to 
phase  has  been  provided  by  Allen  (1972),  who  has  successfully 
removed  the  character! stlc  buzz  from  synthetic  speech  by  randomizing 
the  phase  of  contiguous  spectral  components. 

Hansen  and  Madsen  (1974)  showed  that  the  ear  was  able  to 
detect  phase  changes  without  amplitude  changes.  The  authors  also 
noted  that  phase  detectability  ua9  increased  when  loudspeakers 
having  poor  transfer  character  i st  i cs  were  used  as  the  st  imulus 
transducer. 

In  summary,  a number  of  studies  have  been  discussed 
pertinent  to  the  phase  sensitivity  of  the  ear  with  the  consensus 
being  that  the  ear  is  capable  of  perceiving  phase  modifications  in 
acoustic  signals. 


CHAPTER  IV 


METHODS  AND  INSTRUMENTATION 

A major  thrust  of  tha  currant  research,  as  Introduced  In 
Chapter  I,  has  been  the  development  and  application  of  digital 
techniques  to  psychophysical  research.  Many  earlier  i nvest i gatore 
of  the  phase  sensitivity  question  had  to  rely  on  mechanical, 
electromechanical,  or  electrical  analog  devices  uith  the  result  that 
only  simple  stimuli  could  be  produced  and  accurately  controlled. 
The  computer  makes  the  study  under  discussion  feasible  because  it 
makes  possible  the  efficient  and  accurate  generation  of  the  complex 
stimuli  called  for  in  the  design  of  the  experiment. 

It  uould  be  difficult,  If  not  impossible,  to  uee  analog 
equipment  to  generate  the  required  stimuli.  Even  If  an  analog 
approach  uere  taken,  the  signals  uould  only  be  approximate. 
Approximations  uould  at  best  introduce  errors  into  the  study.  The 
study  requires  the  facility  of  exactly  specifying  the  phase 
characteristics  of  a signal  and  of  systematically  modifying  them  eo 
ae  to  probe  a complex  hearing  phenomenon. 

The  purpose  of  the  present  chapter  io  not  only  to  diecues 
the  psychophysical  methods  relevant  to  this  particular  study  but  to 
discuss  also  the  general  instrumentation  of  such  studies  Jon  a large 
scale  digital  computer.  This  goal  uill  be  approached  by  first 
discussing  the  theory  of  linear  systems  and  the  extension  of  the 
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theory  to  discrete  systems.  Computer  terminology  and  concepte  must 
of  necessity  be  used  throughout  the  discussion.  Readers  unfamiliar 
with  computers  are  referred  to  Appendix  B. 

The  Analysis  of  Linear  Systems 

The  analysis  of  linear  systems  has  been  discussed  by 
Papoulis  (1962),  Guillemin  (1963),  and  others.  In  review,  a 
physical  system  may  be  analyzed  by  studying  the  relationship  between 
the  input  to  the  system,  f(x),  and  the  output,  g(x).  Thie 
relationship,  illustrated  in  Figure  1,  may  be  expressed 
mathemat ical ly  as 


S(f(x)l  - g(x) , 

where  S le  a transformation  of  f(x)  into  g(x). 


(1) 


Figure  1.  Representation  of  a physical  system. 

The  system  is  completely  characterized  if  the  output  for  any 
input  is  known.  In  general,  however,  determining  the  output  for  any 
conceivable  Input  is  a formidable.  If  not  Impossible,  task  unless 
certain  simplifying  constraints  are  imposed  on  the  system.  A eyetem 
Is  said  to  be  I I near  if 

L(f  (k)  ♦ f (x)]  - L tf  (x) 1 + L Cf  (k)  1 - g (x)  + g (x)  (2) 

12  1 2 12 

and 
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Ltc  fix)}  - c Ltflxi)  - c g (m)  (3) 

for  all  inputs,  f(x),  and  constants,  c.  L symbolizes  the  linearity 
property.  Property  (2)  is  called  superpos i t i on.  which  indicates 
that  L processes  the  additive  inputs  as  if  they  were  processed 
separately  and  then  added.  Property  (3)  is  called  eca I eab i I 1 tu. 
which  indicates  that  the  outputs  are  scaled  in  correspondence  with 
the  Inputs.  The  mathematical  operations  represented  in  (2)  and  (3) 
are  vector  addition  and  scalar  multiplication. 

Another  simplifying  property  is  that  of  etat ionar i tu.  A 
system  is  stationary  if 

SIf(x  - xo)]  ■ g(x  - xo) , (4) 

which  Is  to  say  that  the  output  for  a given  Input  will  be 
Independent  of  a shift  in  the  independent  variable.  The  property  of 
etationarlty  is  more  properly  termed  t i me  i nvar I ance  when  the 
Independent  variable  is  time. 

It  can  be  shown  that  a system  which  has  been  doubly 
constrained  to  be  both  linear  and  stationary  can  be  completely 
specified  by  a single  function,  h(x),  the  i mpu I se  response  of  the 
system.  The  impulse  response  is  the  output  of  a system  when  its 
Input  Is  an  impulse,  d(x). 


L (d(x)  ] - h ( x ) (5) 

As  approximate  examples  of  impulse  responses  one  may  tap  his 
cheek  as  the  articulators  are  set  to  produce  various  vowels.  The 
oral  cavity  forme  a physical  system  which  when  struck  produces  a 
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character! st Ic  pressure  nave  at  the  mouth  corresponding  to  the  given 
vouel.  Striking  one’s  desk  results  in  the  production  of  a sound 
wave  which  approximates  Its  Impulse  response.  These  samples  are 
only  approximate  because  the  systems  are  not  excited  with  true 
Impulses,  which  by  definition  are  Infinitely  sharp. 

The  magnitude  of  the  impulse  response  will  be  proportional 
to  the  strength  of  the  impulse;  the  stronger  the  impulse,  the 
greater  the  response. 

A given  Input  signal  may  be  conceptualized  ae  a sequence  of 
Infinitely  narrow  impulses  whose  individual  power  corresponds  to  the 
amplitude  of  the  waveform  at  the  individual  points  to  uhich  they 
temporally  correspond.  Each  impulse  sequentially  applied  to  the 
Input  of  a phyeical  system  would  cause  the  system  to  respond  by 
generating  an  Impulse  response  scaled  in  accordance  with  the  power 
of  the  impulse.  Inasmuch  as  the  impulse  response  to  a given  impulse 
may  not  have  died  away  before  the  next  impulse  is  applied,  the 
output  will  consist  of  the  algebraic  sum  of  each  sequentially 
produced  and  scaled  response. 

The  process  described  above  is  called  convol ut 1 on.  The 
Input  function  is  said  to  be  convolved  with  the  impulse  response  to 

produce  the  output  and  is  represented  mathematically  as 

00 

g(x)  - J MXlhlx  - Md\  (S) 

00 

and 

oo 

J h (X) f (x  - Aid*. 

on 


g (x) 


(7) 
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The  equations  show  that  the  order  of  convolution  is  immaterial.  It 
should  be  noted  that  the  convolution  integral  applies  only  to 
linear,  stationary  systems. 

Convolution  is  only  one  approach  to  the  analysis  of  linear, 
stationary  systems.  In  convolution,  the  input  is  broken  up  into 
"slices,"  the  output  determined  for  each  slice,  and  the  individual 
outputs  sumed  to  form  g(x).  The  summing  process  is  difficult 
because  ecaled  and  temporally  displaced  copies  of  the  impulse 
response  must  be  algebraically  added.  The  Fourier  Integral  le 
Important  in  the  analysis  of  linear,  stationary  systems  because  of 
the  ease  with  which  the  output  may  be  determined.  As  introduced, 
the  Fourier  transform  converts  a time  domain  representation  of  a 
waveform  into  a frequency  domain  representation.  The  Fourier 
Integral  transform  is  defined  mathematically  as 

00 

f -jwx 

F (w)  - / f(x)e  dx.  (8) 

oo 

F(u)  Is  in  general  complex.  The  units  of  w are  radians  per  unit  of 
x.  Although  the  Fourier  transform  does  not  exist  for  all  functions, 
It  will  for  those  functions  of  interest  to  this  thesis.  The  Inverse 
Fourier  Integral  transform  is  given  by 

oo 

/jwx 

F (w)  e dw.  (9) 

00 

To  appreciate  the  application  of  the  Fourier  integral  to 

system  analysis,  one  must  realize  that  if  the  input  to  a system  is 
jwx 

an  exponential  e , then  the  output  will  also  be  an  exponential. 
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but  scaled  In  proportion  to  the  input. 

j ux  jwx 

Lie  1 - ke  - g(x)  (10) 

The  proport  1 ona I I ty  constant,  k,  is  in  general  complex  and  is 
usually  symbolized  H(w).  The  proportionality  constant  Is 
represented  In  both  polar  and  rectangular  coordinates  in  (11). 

j0  (w) 

H(h)  - A(w)e  - R(w)  + jX(w)  (11) 

H(m)  is  the  Fourier  transform  of  the  impulse  response,  h(x),  and  is 
cal  led  the  austem  function. 

00 

/-jwx 

h(x)e  dx  (12) 

00 

Ulth  F(u)  the  Fourier  transform  of  f(x)  and  G(w)  the  Fourier 
transform  of  g(x)  It  can  be  shown  that 

G (w)  - F (w)  H (w) , (13) 

which  Is  to  say  that  in  the  frequency  domain  the  output  of  a linear 
system  is  the  product  of  the  input  and  the  system  function.  The 

value  of  the  Fourier  transform  in  analysis  can  be  seen  from  (13)  t 

> 

the  Fourier  transform  maps  convolution  into  multiplication.  The 
added  cost  of  such  analysis  is  two  forward  transforms  and  one 
Inverse  transform. 


/ 
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The  Discrete  Fourier  Transform  and 
Discrete  Linear  Systems1 

Computers  are  only  able  to  process  numbers  or  symbols 
because  of  their  inherently  discrete  nature.  Symbolic  processing  is 
of  little  use  in  signal  processing,  however,  because  signals  of 
interest  generally  have  no  known  representative  formuli.  It  Is 
consequently  necessary  to  represent  continuous  signals  as  though 
they  were  discrete.  Pi screte  funct i one  may  be  represented  within 
computer  memory,  inasmuch  as  such  functions  are  defined  only  at 
discrete  values  of  the  independent  variable,  by  simply  storing 
sequential  values  of  the  function  in  contiguous  memory  locations. 
Shannon  has  bridged  the  gap  betueen  the  representation  of  continuous 
function  as  discrete  function  by  showing  that  continuous  functions 
may  be  sampled,  i.e.,  measured  at  discrete  intervals  of  time, 
without  loee  of  information,  provided  that  the  sampllng(  rate  le 
greater  than  twice  the  frequency  of  the  highest  frequency  component 
within  the  function. 

In  addition  to  discretely  representing  signals  of  interest, 
it  is  also  necessary  to  develop  discrete  forms  of  the  Fourier 
transforms  such  that  the  properties  of  the  transforms  may  be 
applicable  to  digital  signal  processing.  The  Fourier  transform  of  a 
discrete  function,  called  the  di screte  Four i er  transform  (DFT)  must 
not  only  be  a discrete  function  itself,  but  also  be  bounded  such 


It  is  difficult  to  exercise  much  originality  In  a concise 
discussion  of  discrete  linear  systems  due  to  ...e  ever  increasing 
number  of  references  on  the  subject.  This  particular  introduction 
to  discrete  systems  follows  partially  from  the  work  of  Cole  (1973). 
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that  It  may  bo  represented  in  the  computer. 

It  is  Instructive  to  note  in  regard  to  the  development  of 
the  DFT  that  It  may  be  shown  that  the  transform  of  a periodic 
waveform  is  discrete  whereas  the  transform  of  an  aperiodic  waveform 
is  continuous.  Inasmuch  39  the  OFT  of  a discrete  function  must  also 
be  discrete.  It  logically  follows  that  the  function  and  Its 
transform  must  also  be  periodic.  Cole  (1973)  has  noted  that  many  of 
the  problems  encountered  in  digital  signal  processing  are  due  to  the 
failure  to  remember  the  periodicity  of  the  DFT  and  its  inverse. 

Rablner  and  Schafer  (19S9),  in  an  approach  to  the 
development  of  the  OFT,  first  developed  the  so  called  Z-transf orm  of 
a discrete,  aperiodic  function.  By  the  properties  Just  introduced, 
the  Z-transform  must  be  both  periodic  and  continuous.  The 

Z-traneform  is  sampled  to  produce  a periodic,  discrete  function, 
which  is  taken  to  be  the  DFT. 

Let  f be  a discrete  periodic  function  of  period  N,  The  DFT 
J 

of  f is  another  discrete  periodic  function,  F , also  with  period  N. 

J K 


N-l 

- j (2n/N) JK 

f e 

J 
J-0 


(14) 


-J(2n/N) 

The  complex  expression  e Is  the  principal  Nth  root  of  unity 

and  is  often  abbreviated  as  U. 


F 

K 


N-l 

-JK 

f U 
J 
J»0 


(IB) 
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The  inverse  DFT  is  similarly  defined. 


< f 


f 

J 


N-l 

1,N2Fk 

K-0 


JK 

U 


(1G) 


Signal  processing  became  practical  in  19G5  with  the 
development  of  the  Fast  Four i er  transform  (FFT)  by  Cooley  and  Tukey. 
The  FFT  is  a very  efficient  algorithm  for  computing  the  DFT  and  ae 
euch  does  not  represent  a different  type  of  transform.  The 
computation  time  required  to  compute  the  DFT  of  an  N sample  function 
Is  proportional  to  N squared.  The  FFT  is  able  to  perform  the  same 
transform  by  eliminating  redundant  calculations  through  matrix 
factorization  in  a time  proportional  to  N log  N,  a speed  up  of  N / 
log  N. 

A discrete  I i near  sustem  is  one  which  has  discrete  functions 
as  Its  Input,  output,  and  impulse  response.  Discrete  linear  systems 
also  have  the  properties  of  superposition  and  sea  I eab i I i ty.  If  the 
system  is  also  stationary,  Its  input  and  output  ul  I I be  related  by 
discrete  aper iodic  convo I ut i on.  which  is  defined  mathematically  ae 


g • ^ f h . (17) 

J ' K J-K 

K»-o» 

A second  kind  of  discrete  convolution  Is  per  iodic,  or 
c I rcu I ar  convo I ut i on.  The  circular  convolution  of  two  periodic 
discrete  functions,  f and  h,  is  a third  periodic  function,  g. 


/ 


expressed  mathematically  as 
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N-l 

g - f h . (18) 

J K J-K 

K-0 

It  should  be  noted  that  f,  h,  and  g all  have  the  same  period,  N. 

The  circular  convolution  is  related  to  the  DFT  in  that  It 
can  be  shown  that  if  F,  H,  and  G are  the  respective  discrete  Fourier 
transforms  of  f,  h,  and  g that 


G - F H . (19) 

K K K 


Host  real  world  signals  are  generally  aperiodic. 
Consequently,  in  order  to  U9e  the  DFT  for  practical  eignal 
processing,  it  is  necessary  to  implement  an  aperiodic  convolution 
with  a periodic  convolution.  Stockham  (19G6,  19G9)  has  discussed 
such  a method.  The  process  basical ly  involves  appending  enough 
zeros  to  f and  h so  as  to  give  them  the  same  period  and  to  ensure 
that  the  resulting  period  is  at  least  greater  than  one  Ies9  than  the 
sum  of  the  original  number  of  samples  in  f and  h.  Since  the  length 
of  the  aperiodic  convolution  f and  h is  one  less  than  the  sum  of 
the  number  of  samples  in  each,  the  results  of  the  periodic 
convolution  Mill  be  periodic  but  Mill  represent  the  aperiodic 


convolution  of  f and  h. 
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Stimulus  Recording,  Storage,  and  Playback 
Digital  techniques  were  used  to  record  the  speech  sounds 
used  in  this  study.  The  sounds  were  spoken  by  a male  speaker,  33 


years  of 

age,  who 

was  judged 

by 

the 

author  to 

have  good 

vo  i ce 

qua  1 1 ty. 

into  a 

one- i nch 

B 

and  K 

condenser 

microphone. 

The 

electrical  signal 

was  amp  1 i f i ed. 

low-pass  filtered 

at  4 kHz, 

and 

samp  1 ed 

at  10,000  samples 

per 

second 

wi th  14-bi  t 

reso 1 ut i on. 

The 

samples  were  stored  packed  two  samples  per  word  on  disk. 

The  disk,  as  a mass  storage  device,  only  allows  data  to  be 
addressed,  i.e.,  referenced,  in  blocks  of  128  words.  The  processed 
signals  used  as  stimuli  in  the  study,  however,  are  not  necessarily 
integral  number  of  blocks  in  duration.  Development  of  a disk  aud i o 
file  sustem  incidental  to  the  study  was  consequently  not  only 
expedient  but  necessary  because  of  the  required  control  of  signal 
oneet  and  duration,  and  because  of  the  need  to  reference  quickly  a 
large  number  of  signals  throughout  the  study. 

The  audio  file  system  facilitated  the  storage  and  retrieval 
of  audio  data  on  disk  by  automating  the  bookkeeping  associated  with 
storage  and  retrieval  such  as  the  location  and  duration  of  the  data 
on  disk,  the  sampling  frequency,  etc.  The  system  was  implemented  by 
storing  a directory  image  on  a reserved  portion  of  the  disk.  The 
directory  image  contains  slot6  for  1023  file  headers,  which  in  turn, 
contain  slots  for  the  sampling  frequency,  beginning  block  location, 
file  length,  and  date  of  file  creation.  A particular  file  is 
referenced  by  a number  between  1 and  1023,  which  corresponds  to  the 
file  header  slot  in  the  directory  image.  Rather  than  request  X 
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words  out  of  Y blocks  beginning  at  block  Z,  the  file  eyetem  allows 
one  simply  to  request  the  data  by  specifying  the  file  number.  The 
eyetem  automatically  looks  up  "free  space"  on  disk  whenever  a neu 
file  le  created  so  as  to  allow  one  to  store  data  simply  by 
specifying  the  file  number  under  which  the  data  le  to  be  stored. 

The  following  example  Is  given  to  demonstrate  the  file 
eyetem  In  use  and  to  discuss  signal  playback.  Suppose  one  wished  to 
replay  signal  X.  The  signal  would  be  specified  in  the  user’s 
program.  The  file  system  would  read  the  directory  image  from  disk 
and  look  up  the  associated  bookkeeping  data  from  the  appropriate 
elot  in  the  directory  image.  Enough  blocks  of  audio  data  would  then 
be  read  from  disk  into  contiguous  memory  locations  to  include  the 
entire  audio  signal.  The  clock  in  the  digi tal-to-analog  converter 
would  be  set  from  other  data  in  the  file  header.  The  computer  would 
then  be  Instructed  to  output  the  correct  number  of  samples  from  the 
word  addressable  memory  to  the  d i gl ta I -to-ana I og  converter. 

The  signals  from  the  dl g i ta I -to-ana I og  converter  are 
low-pass  filtered  at  A kHz  and  amplified  for  presentation  through 
high  quality  headphones. 

Filters  and  Stimulus  Preparation 

Filters  are  linear  systems  In  which  the  amplitude  Alw)  and  the 
phase  0(w)  are  both  functions  of  frequency.  The  system  function  of 
a filter  may  be  represented  mathematically  ae 

-J0(w) 

- A(w)e 


H (w) 


(20) 
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where  9(w)  )e  Known  as  the  Phase  shift  or  phase  I aa  funct  i on.  and  ie 

defined  as  -0(w)  where  0(w)  is  the  phase  of  the  system. 

The  stimuli  peculiar  to  this  study  are  prepared  by 
processing  standard  stimuli  with  a filter  whose  phase  characteristic 
Is  that  of  the  desired  phase  modification.  It  ie  desired  that  oniy 

the  phase  be  modified  so  the  amplitude  characteristic  of  the  filter 

Ie  set  to  unity.  Such  filters  are  called  al I -pass  f i I ters  and  may 
be  represented  as  illustrated  in  Figure  2. 

A (w)  j0(w) 

it  t ^ 


Amplitude  Phase 

Figure  2.  Amplitude  and  phase  characteristics  of  a 
generalized  all-pass  system. 


A systematic  investigation  of  phase  sensitivity  depends  on  a 
systematic  variation  of  the  phase  function.  The  phase  modification 
of  choice  in  this  study  is  to  systematically  delay  various  frequency 
bands  within  the  stimuli.  The  parameters  of  the  phase  modification 
Include  center  frequency  (FC),  banduidth  (BU) , and  time  delay  (TD). 
The  idealized  delay  characteristic  of  the  system  used  to  achieve  the 
phase  modification  is  illustrated  In  Figure  3. 

It  may  not  Immediately  be  evident  that  the  type  of  phase 
modification  shown  In  Figure  3 can  generally  represent  phaea 
distortion.  It  will  be  shown  In  Chapter  V,  however,  that  any  phase 
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distortion  Mill  cause  frequency  component  delay.  The  systematic 
variation  of  a phase  function  Incidental  to  Investigating  phase 
sensitivity  may  be  more  easily  conceptualized  in  reference  to  the 
parameters  of  the  delay.  The  idealized  parametere  of  center 
frequency,  bandwidth,  and  time  delay  may  be  utilized  for  systematic 
modi fi cat  ion. 


Time  Delay 
(mSec) 


T 


h-  buh 


i 


FC 


Frequency  (Radi ans/Sec) 


Figure  3.  Delay  characteristic  of  the  system  used  In 
phase  modification  of  signals  to  be  investigated,  showing 
center  frequency  (FC) , bandwidth  (BU),  and  time  delay  (TD) 
as  parameters. 


Usually  in  the  study  of  phase,  a discussion  of  the  delay 
property  follows  from  the  analysis  of  phase  distortion  as  will  be 
done  In  Chapter  V.  It  is  more  convenient  to  the  development  of  the 
present  discussion,  houever,  to  introduce  delay  before  formally 
discussing  phase  distortion. 

The  stimuli  used  in  this  study  were  prepared  by  convolving, 
l.e.,  filtering,  selected  source  stimuli  ulth  appropriate  filters 
designed  to  introduce  the  required  distortion.  A digitally 
implemented  high-speed  convolution  algorithm  (Stockham,  1SG9)  wae 
used  which  was  based  on  multiplication  of  the  DFT’e  of  the  stimuli 
and  filter  as  shown  by  (13). 


Frequency  (Radians/Sec) 

Figure  4.  Idealized  phaee  characteristic  of  the 
system  to  be  investigated. 


Fi l ter  Design 

The  all-pass,  phase  modifying  filters  used  In  this  study 
wore  designed  by  the  so  called  F i ve  T*  s method.  The  filters  are  of 
the  non-recursi ve  type  which  is  to  say  that  they  are  digitally 
Implemented  to  function  In  the  frequency  domain.  Using  the  Five  T'e 
method,  a desired  phase  characteristic  is  first  specified  in  the 
frequency  domain.  It  should  be  remembered  that  the  specified  phase 
character i et 1 c is  discrete  but  it  is  convenient  to  represent  the 
characteristic  graphically  as  if  it  were  continuous,  as  illustrated 
In  Figure  5.  The  realized  phaee  is  continuous. 
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Figure  5.  Frequency  domain  representation  of  the 
idealized  all -pass  system  to  be  designed. 


47 


It  is  natural  to  specify  the  desired  phase  character i et  I c in 
polar  coordinates  inasmuch  as  it  is  in  polar  coordinates  that  the 


effects  of 

the  characteristic  are 

conceptualized.  The 

□FT 

algor  1 thm. 

houever,  requires  that 

the  data 

be 

represented 

1 n 

rectangular 

coordinates  so  the  next 

step  is 

to 

convert  to 

the 

rectangular  coordinate  system.  The  data  is  next  transformed  (Tl)  to 
the  time  domain  by  the  inverse  OFT  to  produce  the  impulse  response 
of  the  filter.  The  time  domain  representation  is  guaranteed  to  be 
real  If  the  magnitude  Is  specified  to  be  even  and  the  phase  to  be 
odd  as  Illustrated  in  Figure  5. 

Another  peculiarity  of  digital  signal  processing  is  that  the 
Impulse  response  of  the  desired  filter  may  be  too  long  to  be 
contained  In  the  computer's  memory.  Inasmuch  as  the  memory  size  of 
the  computer  Is  finite,  the  impulse  response  must  be  truncated  (T2). 
To  reduce  the  traumatic  effects  of  an  abrupt  truncation,  the  impulse 
response  is  Hindoued.  i.e.,  taylored  (T3) , so  as  to  turn  it  on  and 
off  smoothly.  The  modified  impulse  response  is  saved  tentatively  on 
disk  for  possible  later  use  in  signal  processing  the  varioue  teet 
stimuli.  The  Impulse  response  is  transformed  (T4)  back  to  the 
frequency  domain  by  the  inverse  discrete  Fourier  transform, 
converted  to  polar  coordinates,  and  then  compared,  i.e.,  tested 
(TS),  against  the  originally  specified  characteristic.  The  impulee 
reeponse  Is  retained  for  later  use  in  processing  the  test  stimuli  If 
the  character  I st ic  of  the  approximated  filter  is  close  enough  to  the 
epee  I fled  ideal  filter. 
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Digital  and  Analog  Equipment 

All  digital  and  analog  equipment  required  for  the  etudy  was 
available  to  the  author  through  the  Sensory  Information  Processing 
Group  (SIPG),  ulth  which  the  author  ie  associated  at  the  University 
of  Utah.  The  main  computer  facility  Includes  a single-user  PDP-10 
computer  with  64  K of  36-bit  words,  a POP-10  time-sharing  computer 
ulth  196  K of  words  for  program  preparation  and  debugging,  and  all 
associated  computer  peripherals.  All  audio  equipment,  including 
filters,  amplifiers,  tape  recorders,  etc.,  Is  of  the  highest 
professional  quality.  Included  In  the  SIPG  facilities  Is  a large 
sound  isolated  "quiet  room"  in  which  test  signals  were  recorded  and 
the  psychophysical  listening  tests  were  conducted. 

Manipulation  of  the  audio  signals  was  facilitated  by  use  of 
a general  purpose  audio  console  designed  and  fabricated  by  the  SIPG 
staff  engineer  (Uarnock,  1973).  The  audio  system  consists  of  a 
modular  set  of  high  quality  amplifiers,  filters,  attenuators,  and 
signal  generators  that  may  be  plugged  into  the  console  cabinet.  The 
cabinet  contains  a common  powsr  supply.  Signal  routing  is  handled 
through  BNC  connectors  on  ths  front  of  Bach  module.  Various 
grounding  configurations  among  the  modules,  cabinet  chassis,  and 
earth-ground  are  provided  by  means  of  switches  so  as  to  minimize  hum 
and  noise  within  the  system. 

The  variable  gain  amplifier  modules  allow  continuous  gain 
adjustments  from  -40  to  +20  dB  over  a frequency  range  from  DC  to  50 
kHz. 
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The  selectable  low-pass  filter  module  allows  switch 
eelection  of  one  of  four  TT  Electronics  (Model  J77AI  low-pass 
filters  with  3 dB  cutoff  frequencies  of  4 kHz,  7 kHz,  10  kHz,  and  15 
kHz.  The  attenuator  module  is  patterned  after  the  HP  model  3500  and 
allows  attenuation  up  to  110  dB  in  1 dB  steps.  The 
digl tal-to-analog  converter  has  IB-bit  resolution  with  an  output 
voltage  range  of  20  Volts.  The  output  of  the  converter  is 
attenuated  17  dB  to  avoid  overloading  the  low-pass  filter,  filtered 
at  4000  Hertz,  and  then  amplified  to  drive  the  headphones.  The 
low-pass  filter  introduces  a loss  of  G dB.  No  calibration  data  was 
available  for  the  Koss  PR0-4A  headphones  used  throughout  the  study. 

The  signal-to-noi se  ratio  of  the  amplifiers  and  filters  is 
greater  than  100  dB.  The  total  harmonic  distortion  of  the 
amplifiers  is  less  than  8G  dB.  The  arrangement  of  the  equipment  is 
illustrated  in  Figure  6. 


Figure  6.  Equipment  arrangement  for  di ecr i mi nat ion  testing  of 
dispersion. 
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Psychophysical  Methods 

Psychophysical  testing  of  phase  ie  difficult  because  of  the 
nature  of  the  required  discrimination  task,  which  ie  the  detection 
of  changes  in  sound  quality.  Various  factors  influence  the 
cognitive  process  of  assigning  a metric  to  the  quality  of  an 
experience,  some  of  which  may  be  quite  independent  of  the  presence 
or  intensity  of  the  various  physical  dimensions  of  the  event  under 
cons i derat  Ion. 

Listeners  generally  have  little  difficulty  in  assigning 
values  to  single  dimensional  events  euch  as  to  sound  frequency  or 
Intensity.  The  dl ecr Imlnabl 1 1 ty  of  sound  quality,  however.  Is  more 
difficult  because  of  the  multi-dimensionality  of  the  factors  that 
Influence  the  psychological  evaluation  of  the  event.  The  detection 
problem  may  be  further  complicated  by  the  possibility  that  different 
aural  mechanisms  account  for  different  aspects  of  phase  sensation. 

The  psychophysical  method  used  in  determining  the 
sensitivity  of  the  ear  in  this  study  was  that  of  constant  st i mu  I I In 
which  a direct  comparison  was  required  between  a standard  stimulus 
and  a number  of  phase  modified  variants  of  the  stimulus. 
Notwithstanding  the  difficulty  of  establishing  a criterion  for  phase 
changes,  listeners  were  Instructed  to  make  phase  change 
discriminations  in  the  series  of  experiments  to  be  discussed  by 
responding  to  "any  difference  in  the  way  two  stimuli  Bounded"  when 
presented  together  as  a stimulus  pair. 

A signal  comparison  testing  program  was  written  by  the 
author  that  randomly  presents  a number  of  audio  file  palrlnge  to  a 
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listener  for  d) scr iminat ion.  The  audio  file  numbers  of  the  files  to 
be  paired  are  specified  in  a command  list  90  as  to  make  the  testing 
program  as  general  as  possible.  For  example,  If  audio  file  103  and 
413  are  to  be  compared  with  file  87,  the  command  list  Mould  contain 
the  fol lowing  datai 

87,  103 
87,  413 

s s s 

A random-like  number  generator  is  used  uithin  the  testing 
program  to  randomize  the  command  list  and  consequently  the  order  of 
the  signal  pair  presentation.  The  number  of  times  the  testing 
program  randomizes  and  executes  the  command  list  may  be  set  by  a 
program  parameter.  The  temporal  spacing  between  the  signals  of  the 
pair  Is  also  adjustable. 

The  signal  pairs  were  presented  to  each  listener  during  the 
course  of  a test  through  high  quality  headphones  at  "average 
conversational  speech  levels."  Ulth  each  presentation  of  a signal 
pair,  the  lletener  was  required  to  respond  either  by  depressing  a 
"same"  or  "different"  key  on  the  computer  terminal  depending  on 
whether  or  not  the  signals  sounded  the  same  or  different.  The 
testing  program  records  the  listeners  responses  and  provides  a 
statistical  rsduction  of  the  data  at  the  conclusion  of  the  test. 


CHAPTER  V 


THE  ANALYSIS  OF  PHASE  DISTORTION 

It  will  be  helpful  to  revieu  the  relation  betueen  phase 
characteristics  and  phase  distortion  as  obtained  by  analytical 
methods  before  discussing  experimental  results.  Phase  mod  I f i cat  i one 
are  secured  by  processing  a signal,  f(t),  ulth  a phase  modifying 
system.  The  absolute  phase  character i st ic  of  fit)  is  of  no  interest 
except  to  the  extent  that  it  joins  in  the  definition  of  fit).  The 
phase  character i et I c of  Interest  Is  that  of  the  phase  modifying 
system  which  Is  represented  In  the  modified  signal,  git),  as  the 
phase  difference  between  git)  and  fit).  It  is  consequently 

convenient  to  discuss  the  phase  characteristic  of  the  modifying 
system,  HIw),  as  the  phase  character'!  st  ic  of  interest. 

Characteristics  of  a Distortionless  System 
Following  the  development  of  Papoulis  (1962),  a 

distortionless  filter  is  one  whose  output,  git),  to  an  arbitrary 
input,  fit),  has  the  same  form  as  the  input.  Noting  that  scaling  or 
delaying  a signal  does  not  change  its  form,  the  input-output 
relationship  of  a distortionless  filter  may  be  written 

git)  - Aflt  - to).  (21) 

The  system  may  be  analyzed  using  the  Fourier  integral  transform. 

- jwto 

Ulth  C(w)  the  transform  of  git)  and  e F (w)  the  transform  of 


53 


f(t  - to),  then 


- jwto 

G (u)  - Ae  F (w) . (22) 

Fol lowing  from  (13),  the  eystem  function  of  a dl  etort lonleee  system 
ie  given  by 

-jwto 

H(u)  - Ae  , (23) 

from  which  the  amplitude  ie  seen  to  be  constant  and  the  phaee 
I inear. 

A(w)  - A 0(w)  - -jwto  (24) 

The  eyetem  ie  eaid  to  be  amp  I i tude  di  9tor  ted  if  A(w)  ie  not  conetant 
and  Phaee  distorted  if  0(w)  ie  not  linear. 

It  may  be  observed  from  (24)  that  the  time,  t,  by  which  the 
eyetem  delays  the  input  ie  given  by 

to  - 0(w)/w  - d/du  0(w) . (25) 

which  Is  to  say  that  distortionless  systems  act  only  to  delay  the 
Input  signal  by  an  amount  equivalent  to  the  elope  of  the  phase 
character)  st  ic.  If  0(w)  is  in  radians  and  w - 2 f,  where  f ie 
frequency  in  Hertz,  the  delay  will  be  in  seconds. 

Interpretat ion  of  Phase  Distortion 
Phase  distortion  occurs  uhen  a system  falls  to  maintain  or 
shifts  the  phase  relations  among  the  frequency  components  of  an 
applied  signal.  Consider  first  the  effect  of  adding  a conetant 
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phase,  0,  to  the  phase  characteristic.  Letting  A(w)  >1,  a 
distortionless  filter  Is  represented  by 

- Jwto 

H(ui)  - e (26) 

as  developed  In  the  previous  section.  Adding  the  constant  phase,  0, 
gives 

a j (0-wto)  j0  -jwto  j0 

H(w)  ■ e - e e - e H(w),  (27) 

which  Is  to  say  that  shifting  the  phase  characteristic  by  a constant 
phase  Is  equivalent  to  multiplying  the  system  function  by  a complex 
constant.  Lane  (1930)  and  Steinberg  (1930)  have  carried  the 
Interpretation  further.  Since 

J0 

e - cos  0 + j sin  0,  (28) 

and  noting  that  0(u)  is  an  odd  function,  the  phase  shifted  output 
may  be  considered  to  be  the  sum  of  two  parts.  The  first  part  is  a 
fidelity  copy  of  the  input  signal,  f(t),  scaled  in  amplitude  by 
cos  0 and  delayed  by  to.  The  second  part  results  from  shifting  all 
the  components  of  f(t)  by  IT/ 2,  scaling  by  sin  0,  and  then  delaying 
by  to. 

Two  special  cases  should  be  noted.  If  0 is  0 or  an  even 

J0 

multiple  of  rr  then  e >1,  which  Is  equivalent  to  a distortionless 

J0 

system.  If  0 Is  an  odd  multiple  of  7T  then  e - -1,  which  le 
equivalent  to  a distortionless  system  with  reversed  output.  In 
either  special  case  there  is  no  distortion,  only  a delay. 
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In  conclusion,  the  elope  and  y-intercept  components  of  a 
linear  phase  function  correspond  respectively  to  a distortionless 
delay  and  a distortion  producing  constant  phase  addition  resulting 
from  the  constant  phase  addition.  It  is  convenient  to  conceptualize 
the  two  components  as  causing  sequential  operations  In  linear 
systems,  l.e.,  a delay  followed  by  a distortion. 

Distortion  Following  from  Nonlinear 
Phase  Character i st i cs 

Steinberg  (1930)  analyzed  the  effect  of  phase  distortion 
resulting  from  curved  phase  character i et I cs  by  considering  euch 
characteristics  to  be  limiting  cases  of  character i st i cs  made  up  of  a 
number  of  straight  lines.  Each  line  approximates  the  curved 

character i st i c for  a given  frequency  range,  aw.  The  frequency 
components  in  each  range  will  be  subjected  to  a delay  and  distortion 
as  previously  discussed  and  illustrated  in  Figure  7. 


Figure  7,  Curved  phase  character i et ic  segment  approximated  by  a 
straight  line.  The  elope  of  the  line  represents  the  temporal  delay 
of  the  frequency  band  and  the  Y-intercept  the  constant  delay. 
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It  should  be  apparent  when  considering  the  affect  of  a 
curved  phase  characteristic  on  the  whole  Input  signal  that  each  band 
Is  delayed  differently  than  its  adjacent  bands  so  ae  to  spread  out 
the  signal  on  the  time  scale.  The  signal  is  said  to  be  d I spersad. 
Each  band  of  frequencies,  £\w,  is  delayed  relative  to  the  minimum 
slope  of  the  phase  characteristic.  The  relative  delay  has  been 
defined  by  Steinberg  as  dalau  distortion. 


CHAPTER  VI 


SENSITIVITY  TO  DISPERSION  IN  IMPULSES 

The  purpose  of  the  experiments  described  and  discussed  in 
this  chapter  was  to  determine  the  di scriminabi I i ty  between  an 
Impulse  and  Impulses  in  which  a band  of  frequency  components  are 
delayed.  An  Impulse  is  an  Infinitesimally  short  signal  in  which  all 
frequency  components  are  present  at  equal  amplitudes.  Impulses  were 
chosen  as  interesting  stimuli  because  they  appear  to  be  among  the 
simplest  stimuli  possible  Inasmuch  as  their  spectra  are  of  constant 
amplitude.  It  is  not  unreasonable  to  assume  that  the  psychophysical 
data  derived  from  teste  using  more  simple  stimuli  may  be  more  easily 
analyzed  than  data  derived  from  those  tests  using  more  complex 
stimuli  such  as  speech.  Such  data  may  lead  to  more  direct  insights 
Into  the  nature  of  phase  diecriminabi  I i ty.  The  insights  derived 
using  simple  stimuli  may  aid  in  the  analysis  of  data  derived  from 
tests  utilizing  the  more  complex  stimuli. 

Sensitivity  of  Three  Subjects  to 
Dispersion  In  Impulses 

The  Intent  of  ths  first  experiment  was  to  ascertain  the 
di  scriminabi I i ty  of  three  subjects  between  an  impulse  and  impulses 
in  which  a band  of  frequency  components  are  delayed.  The  manner  in 
which  a given  impulse  used  as  a stimulus  in  the  experiment  was 
modified  has  been  discussed  In  Chapter  IV.  In  brief,  a standard 
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Impulse  is  processed  so  as  to  delay  a band  of  its  constituent 
frequency  components,  the  independent  variables  of  interest  being 
center  frequency,  banduidth  and  temporal  extent  of  the  delay.  The 
center  frequency  In  the  present  experiment,  houever,  uas  held 
conetant  at  500  Hertz.  Four  values  of  banduidth  and  four  of  delay 
uere  selected  so  as  to  test  discriminabi I i ty  as  a function  of  16 
unique  combinations  of  banduidth  and  delay.  The  banduidths  included 
100,  200,  400  and  800  Hertz.  The  delays  Included  0.0625,  0.125, 
0.25  and  0.5  milliseconds.  Sixteen  phase  modified  impulses  uere 
prepared  according  to  the  prescribed  schedule  as  stimuli  for  the 
experiment. 

The  experiment  consisted  of  administering  16  individual 
tests  to  each  of  the  three  subjects  so  as  to  obtain  a phase 
dl ecr iminabi 1 1 ty  estimate  for  each  of  the  scheduled  phaee 
conditions.  Each  test  consisted  of  a sequential  presentation  of 
stimulus  pairs.  The  subjects  uere  required  to  make  a forced  choice 
decision  as  to  uhether  or  not  the  stimuli  of  a given  test  pair  uere 
the  "same1'  or  "different."  Either  stimulus  of  the  pair  could  be  the 
etandard  impulse  (SI)  or  a phase  modified  variant  of  the  impulse 
(S2)  so  as  to  generate  four  unique  stimulus  pair  types,  i.e.,  Sl-Sl, 
S1-S2,  S2-S1  and  S2-S2.  The  probability  of  occurrence  of  each  type 
uas  equal  so  as  to  represent  a rectangular  distribution.  This 
scheme  uae  adopted  to  average  out  any  experimental  bias. 
One-hundred  stimulus  pairs  uere  presented  in  each  test. 

Three  female  speech  pathology  students  uith  normal  hearing 
and  of  21  to  23  years  of  age  served  as  subjects.  The  stimuli  uere 
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presented  monaural ly  to  the  subject's  left  ear  at  a "low"  sensation 
level  of  approximately  30  to  40  dB.  The  intra-stimulus  duration 
between  stimuli  of  the  pair  was  approximately  700  milliseconds.  The 
Inter-stimulus  duration  was  500  milliseconds,  l.e.,  500  milliseconds 
elapsed  after  the  subject's  response  and  the  automatic  onset  of  the 
next  stimulus  pair. 

The  subjects  were  sequentially  tested  at  the  same  phase 
condition  for  each  of  the  IS  tests.  The  test  order  was  randomized. 
This  approach  was  taksn  becauss  it  was  not  initially  certain  that 
the  range  of  values  sslected  for  the  independent  variables  would  be 
of  interest  for  all  subjects.  The  approach  allowed  modification  of 
the  variable  values  without  detriment  to  the  experiment.  The  loss 
sustained  by  the  experiment  because  of  this  approach,  however,  was 
the  loss  of  absolutely  identical  testing  conditions  for  each  of  the 
tests  inasmuch  as  the  formal  testing  period  required  three  testing 
sessions  over  as  many  days.  The  subjects  received  three  training 
session  prior  to  formal  testing. 

Tr<e  test  results  for  each  of  the  three  subjects  are 
tabulated  In  Table  2 (Appendix  C)  and  are  illustrated  in  Figures  8 
and  9.  The  results  of  the  study  are  Interesting  but  not  overly 
surprising.  The  data  when  plotted  as  a function  of  delay  as  shown 
In  Figure  8 Indicate  a definite  dependence  of  diecrlmlnabi  1 1 ty  on 
delay  for  a given  bandwidth.  It  is  not  clear  from  a consideration 
of  the  data  plotted  as  a function  of  bandwidth  as  shown  in  Figure  9 
that  there  is  a significant  increase  in  di  scriminabl  I i ty  with 
bandwidth  for  a given  delay. 
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Figure  8.  Oi  ecr  iminabi  I i ty  of  subjects  AK,  MF  and  PA  between  an 
impulse  and  phase  modified  impulses  plotted  as  a function  of  delay 
with  bandwidth  as  parameter.  Center  frequency  of  the  delayed  band 
ie  500  Hertz. 
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The  only  similarity  among  tho  subjects  is  an  Increase  In 
dl  scr  Imlnabl  1 1 ty  with  banduldth  at  a delay  of  0.25  milliseconds. 
The  curves  appear  relatively  flat  otherwise  except  for  a dip  in  the 

0. 5  millisecond  curve  for  subject  HF  and  PA  at  different  banduidths, 
and  a peak  in  the  0.125  curve  for  subject  (IF. 

A convenient  statistic  for  indicating  the  ability  to 
discriminate  is  half  way  batueen  chance  and  perfect  performance. 

1. e..  the  0.75  discrimination  score.  The  subjecte  are  able,  in 
general,  to  respond  to  delays  in  impulses  of  between  0.125  and  0.5 
milliseconds  which  are,  apparently,  relatively  independent  of 
bandwidths  greater  than  100  Hertz. 

Sensitivity  to  Dispersion  in  Impulses 
as  a Function  of  Intensity 

The  test  stimuli  used  in  the  experiment  just  discussed  were 
presented  at  a low  level  to  prevent  possible  overloading  and 
subsequent  distortion  in  the  headphones.  It  Is  not  impossible  that 
the  stimulus  level  was  so  low  that  the  stimuli  could  not  be  heard 
adequately  to  provide  valid  discriminations. 

A second  experiment  was  conducted  to  test  the  effect  of 
intensity  on  dlscriminabl I i ty.  A standard  impulse  was  paired  uith  a 
phase  modified  impulse  of  500  Hertz  center  frequency,  400  Hertz 
bandwidth  and  0.25  millisecond  delay  at  three  Intensity  levels.  The 
intensity  levels  were  6 dB  below  the  level  used  In  the  last 
experiment,  at  the  level  used  in  the  last  experiment,  and  6 dB  above 
the  level  used  in  the  last  experiment.  The  testing  design  and 
conditions  uare  otherwise  identical  to  the  experiment  Juet 
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discussed.  The  test  results  are  tabulated  in  Table  3 (Appendix  C) 
and  are  Illustrated  In  Figure'lO.  The  discrimination  scores  mag  be 
observed  to  be  definitely  related  to'  intensity. 


RELATIVE  INTENSITY  (OB) 

Figure  10.  01  scriminabi  1 1 ty  of  subjects  AK,  MF  and  PA  between  an 
Impulse  and  phase  modified  impulses  plotted  as  a function  of 
intensity.  The  parameters  are  500  Hertz  center  frequency,  400  Hertz 
bandwidth  and  0.25  milliseconds  delay. 

To  test  di scriminabi I i ty  at  higher  sensation  levels  and  to 
obtain  a more  complete  set  of  data  in  which  di  scriminabi  I i ty  ie 
determined  as  a function  of  intensity,  the  author  repeated  the 
experiment  serving  as  his  own  subject.  Sixty  stimulus  paire  were 
used  In  each  test.  The  intra-stimulus  Interval  wae  set  to 
approximately  800  milliseconds  and  the  inter-stimulus  interval  to 
600  ml  1 1 1 ssconds.  The  stimuli  were  presented  monaura  fly  to  the 
right  ear.  The  audio  equipment  was  set  to  provide  approximately  5 
dB  gain  following  the  output  of  the  digi  tal-to-analog  converter, 
l.e.,  17  dB  attenuation  before  filtering  and  22  dB  gain  following 
filtering.  An  attenuator  was  included  in  the  circuit  to  provide  for 
intensity  level  control.  Using  the  method  of  adjustment,  the  author 
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determined  hie  threshold  to  the  standard  impulse  to  be  at  93  dB 
attenuation.  It  Is  in  reference  to  this  setting  of  the  attenuator 
that  the  various  sensation  levels  used  in  the  experiment  are 
referenced.  The  experimental  results  are  tabulated  in  Table  4 
(Appendix  C)  and  are  illustrated  in  Figures  11  and  12. 

Oiscriminabi I i ty  may  be  observed  from  Figures  11  and  12  to 
be  strongly  dependent  on  intensity  for  a given  center  frequency, 
banduidth  and  delay.  This  general  finding  is  in  agreement  with  the 
previous  experiment  in  uhich  three  subjects  uere  tested.  The  widely 
separated  parametric  curves  of  Figure  11  indicate  a etrong 
dependence  on  delay  for  a given  bandwidth  whereas  the  close 
clueterlng  of  the  parametric  curves  in  Figure  12  indicate  little,  If 
any,  dependence  on  banduidth  for  bandwidths  greater  than  100  Hertz 
for  a given  intensity.  The  intensity  function  illustrated  in  the 
different  sub-figures  of  Figure  11  appear  to  be  roughly  linear  with 
slopes  that  appear  to  have  a slight  dependence  on  delay.  One  would 
expect  increased  discriminabi I i ty  as  a function  of  intensity 
inasmuch  as  the  dispersed  components  of  the  impulse  become  more  and 
more  audible  for  a given  dispersion  uith  an  increase  in  elgnal 
level.  Greater  delays  in  the  dispersed  frequency  band  have  already 
been  shown  to  bs  more  discriminable  than  shorter  delays.  It 
follows,  consequently,  that  the  slope  of  the  intensity  function 
should  be  dependent  on  the  given  delay.  That  the  stimuli  did  not 
produce  nonlinearities  is  implied  by  the  well  behaved  nature  of  the 
data.  Speech  stimuli  presented  at  the  same  levels  did  not  have  the 
buzzy  quality  characteristic  of  distortion  overloading. 
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Figure  11.  Oiacriminabi I i tg  of  Subject  UG  between  an  impulee  and 
phaee  modified  impulses  at  a function  of  intensity  with  delay  3* 
parameter. 
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Figure  12.  Oiscriminabi 1 1 ty  of  Subject  UG  between  an  impulse  and 
phase  modified  impulses  as  a function  of  intensity  with  bandwidth  as 
parameter. 
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01 ecr iminabi I i ty  as  a Function  of  the 
Psychophysical  Method 

It  Is  well  known  that  teat  results  are  a function  of  the 
testing  paradigm.  The  experimental  results  derived  thus  far  have 
been  derived  from  separate  tests  for  each  data  point  as  already 
discussed.  A second  test  series  in  which  more  than  one  stimuli  was 
randomly  presented  during  the  course  of  the  tests  was  conducted  by 
the  author  with  himself  as  subject  to  obtain  an  estimate  of  how  the 
results  might  be  affected  by  the  testing  method. 

Eight  separate  tests  wars  administered  to  reduce  the  length 
of  any  given  testing  session.  Each  test  tested  four  of  the  poseible 
16  phase  conditions.  The  four  phase  conditions  were  randomized 
across  ail  the  phase  conditions  and  enough  tests  were  administered 
to  obtain  two  estimates  for  each  of  the  phaae  conditions.  The  test 
conditions  were  the  same  as  the  last  test  except  for  the  testing 
method,  the  use  of  40  stimulus  pairs  per  condition,  ano  the 
presentation  of  the  stimuli  at  a constant  level  of  80  dB  sensation 
level.  The  average  of  the  two  estimates  for  each  condition  are 
tabulated  in  Table  5 (Appendix  C) . 

One  may  observe  by  comparing  the  results  tabulated  In  Table 
4 and  5 that  use  of  the  second  test  design  results  in  scores 
indicative  of  less  sensitivity  to  phase  changes  than  that  indicated 
by  the  first  test  design.  This  finding  is  not  unreasonable.  The 
eubject  is  better  able  to  concentrate  on  smaller  nuances  of 
difference  between  the  standard  stimulus  and  the  modified  stimulus 
when  only  one  phase  condition  is  being  contrasted  in  a given  test. 
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The  results  of  the  tuo  testing  designs  do  not  appear  to  differ 
• Ignl f leant  I y. 


Sensitivity  of  the  Author  to  Dispersion 
in  Impulses 

To  obtain  a more  complete  set  of  data  in  which 
dlscrlmlnabl 1 1 ty  uas  to  be  determined  as  a function  of  center 
frequency  as  uell  as  bandwidth  and  delay,  the  author  repeated  the 
firet  experiment  discussed  in  this  chapter  but  served  as  hie  own 
subject.  A second  reason  for  repeating  the  experiment  was  to  use 
stimuli  at  higher  intensities  similar  to  those  encountered  in 
conversational  level  speech.  A third  reason  for  repeating  the 
experiment  uas  to  obtain  some  measure  of  variability  of  the 
discrimination  function. 

The  experimental  design  uas  the  same  as  in  the  first 
experiment  except  that  the  stimuli  uere  presented  monaural ly  to  the 
right  ear  at  a sensation  level  of  80  dB.  Fourty  stimulus  pairs  uere 
presented  in  each  test.  The  intra-stimulus  delay  uas  approximately 
800  milliseconds.  The  vast  majority  of  the  individual  phase 
conditions  uere  randomly  tested  three  times  during  the  course  of  the 
experiment  to  obtain  an  estimate  of  the  variance  of  the 
dl scrlmlnabl 1 1 ty  function  for  the  particular  phase  condition.  In 
some  cases  where  discrimination  uas  obviously  perfect,  only  tuo 
teste  uere  performed  to  obtain  the  estimate.  As  many  as  five  tests 
were  used  where  variability  appeared  to  be  excessive.  All  the 
Individual  tests  for  ths  various  phase  conditions  uere  randomly 
administered.  The  test  results  are  tabulated  in  Table  6 (Appendix 
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C)  and  are  Illustrated  in  Figures  13  through  18.  The  Intent  of  the 
author  uat  to  discuss  the  data  trends  as  they  were  made  manifest  by 
plotting  the  data  as  functions  of  the  three  Independent  variables 
and  then  to  discuss  the  trends. 

Figure  13  and  14  represent  the  data  plotted  as  a function  of 
delay.  Both  figures  indicate  that  discrimination  is  strongly 
dependent  on  delay  for  a given  center  frequency  and  bandwidth.  The 
clustering  of  the  parametric  curves  in  Figure  13  indicates  a weaker. 
If  any,  dependence  of  discriminabi I i ty  on  bandwidth  for  the 
bandwidths  tested.  Note  that  in  the  sub-figures  of  Figure  13  that 
except  for  the  60  and  100  Hertz  parametric  curves  that  the  curves 
monotonlcal ly  Increase  to  the  upper,  left  of  the  sub-figures.  The 
general  clustering  and  monotonic  increase  indicate  a mild  but 
significant  dependence  of  discrimination  on  bandwidth.  The 
significance  of  the  exceptional  behavior  of  the  50  Hertz  curve  will 
be  discussed  in  the  next  section.  The  locus  of  the  parametric 
cluster  for  the  250  Hertz  sub-figure,  being  different  from  that  of 
the  other  sub-figures,  indicates  a dependence  of  di ecr 1 mi nat 1 on  on 
center  frequency.  This  dependence  on  center  frequency  is  made  more 
evident  in  Figure  14  where  the  data  is  plotted  with  center  frequency 
as  parameter.  In  every  case  where  the  250  Hertz  center  frequency 
curve  is  parametrically  plotted,  the  curve  is  seen  to  be  more  or 
less  isolated  from  the  other  parametric  curves.  The  clustering  of 
the  other  curves  indicate  a stronger  dependence  of  discriminabi  I I ty 
on  center  frequency  for  lower  frequencies  that  becomes  less 
significant  with  an  increase  in  frequency.  The  general  monotonlc 
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Increase  in  the  values  of  the  parametric  curves  in  all  the 
sub-figures  of  Figure  14  indicate  that  center  frequency  does  not 
become  insignificant,  however,  at  the  highest  frequencies. 

Figure  15  and  IS  represent  the  data  plotted  as  a function  of 
bandwidth.  In  gsneral,  the  curves  of  both  figures  are  slightly 
monotonical ly  Increasing  with  bandwidth.  The  wide  but  monotonic 
dispersion  of  the  parametric  curves  In  Figure  15  and  the  lack  of 
similarity  of  the  sub-figures  In  Figure  15  indicates  a etrong 
dependence  of  discrimination  on  delay.  The  0.0625  millisecond 
parametric  curve  in  all  the  sub-figures  of  Figure  15  except  for  the 
250  Hertz  sub-figure  appear  to  be  more  isolated  from  the  other 
parametric  curves  which  tend  to  be  more  clustered. 

Figure  17  and  18  represent  the  data  plotted  as  a function  ^bf 
center  frequency.  Both  figures  imply  a strong  dependence  of 
di scriminabi I i ty  on  center  frequency  for  a given  bandwidth  and 
delay.  The  lack  of  general  similarity  of  the  eub-figures  in  Figure 
17  and  the  dispersion  of  the  parametric  curves  in  each  sub-figure  of 
Figure  18  again  Indicates  a dependence  of  discrimination  on  delay. 
The  monotonic  but  moderate  clustering  of  curves  in  each  sub-figure 
of  Figure  17  and  general  similarity  of  the  sub-figures  of  Figure  18 
Indicate  a mild  but  significant  dependence  of  discrimination  on 
bandwidth.  The  0.0625  millisecond  parametric  curve  In  all  the 
sub-figures  of  Figure  18  Is  generally  isolated  from  the  other 
parametric  curves  which  tend  to  be  more  clustered  except  In  the  100 
Hertz  aub-figure.  This  isolation  Ib  similar  to  that  shown  in  Figure 
IS. 
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Figure  13.  01 ecr Imlnabl  1 1 tg  of  Subject  UG  between  an  Impulse  and 
phaee  modified  impulses  plotted  ae  a function  of  delag  with 
bandwidth  as  parameter. 
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Figure  14.  Di acr iminabi I i tg  of  Subject  UIG  between  an  impulse  and 
phase  modified  impulses  plotted  as  a function  of  delag  with  center 
frequencg  as  parameter. 
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Figure  15.  01 scr Iminabi  I I tg  of  Subject  WG  between  an  Impulee  and 
phase  modified  Impulses  plotted  a9  a function  of  bandwidth  with 
delag  as  parameter. 
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Figure  16.  01 •crlmlnabl 1 1 tg  of  Subject  UG  between  an  Impulee  and 
phaee  modified  impuleea  plotted  ae  a function  of  bandwidth  with 
center  frequencg  ae  parameter. 
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Figure  17.  01 acr iminabi  I I ty  of  Subject  UG  between  an  impulse  and 
phase  modified  impulses  plotted  as  a function  of  center  frequency 
with  bandwidth  as  parameter. 
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Figure  18.  Oiscriminabi  I i ty  of  Subject  UG  between  an  impulse  and 
phaee  modified  Impuleee  plotted  ae  a function  of  center  frequency 
with  delay  aa  parameter. 
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Discussion 

Results  for  the  500  Hertz  center  frequency  condition  ehoun 
In  Figure  13  ehould  be  noted  to  agree  with  those  of  the  three 
subjects  flret  tested  for  the  same  phase  condition  ehoun  In  Figure 
8.  It  appears  that  the  author's  phase  sensitivity  is  slightly  more 
acute  in  that  he  is  able  to  achieve  0.75  discrimination  betueen 
0.0625  and  0.125  milliseconds  of  delay  for  the  500  Hertz  center 
frequency  condition  as  opposed  to  0.125  and  0.25  milliseconds  as 
required  by  the  other  three  subjects.  This  is  no  doubt  due  to  hie 
greater  familiarity  uith  the  stimuli  and  all  of  their  nuances  of 
qual i ty. 

The  data  indicates  that  di scr iminabi I i ty  is  mildly,  yet 
significantly  dependent  on  banduidth.  01 scriminabl 1 1 ty  Is  perfect 
for  certain  values  of  center  frequency,  banduidth  and  delay. 
Di  ecr iminabi 1 1 ty  uould  fall  to  chance,  houever,  if  banduidth  uere 
reduced  to  zero.  One  uould  expect  that  discriminabi I i ty 
monotonical ly  increases  uith  banduidth.  The  values  selected  for 
banduidth  as  the  independent  variable  do  not  allou  the  observance  of 
the  monotonlc  increase  through  a "knee"  or  "point  of  inflection"  in 
the  function.  The  di scr iminabi I i ty  function  appears  to  have  reached 
an  upper  plateau  uhich  is  asymptotically  decreasing  for  the  selected 
values  of  banduidth.  The  displaced  position  of  the  50  Hertz 
banduidth  curve  in  Figure  13  and  17  from  an  otheruiee  monotonic 
ordering  ie  probably  due  to  its  relative  closeness  to  a point  of 
Inflection  in  the  function.  The  experiment  has  thus  demonstrated 
that  the  range  of  interest  for  banduidth  values  ie  lees  than  100 
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Hertz.  The  ubiquitous  critical  band  phenomenon  in  psychophyelcal 
research  may  account  for  the  leveling  off  of  dl ecr iminabl  I 1 ty  for 
banduidths  greater  than  100  Hertz. 

Envisioning  the  topology  of  a function  of  three  Independent 
variablee  is  difficult  at  best.  Inasmuch  as  the  effect  of  bandwidth 
for  the  values  selected  has  significantly  less  effect  on 
di ecr Imlnabi 1 1 ty  than  center  frequency  or  delay,  the  relative 
effects  of  center  frequency  and  delay  may  be  observed  by  averaging 
the  data  across  bandwidth  and  plotting  discrimination  as  a 
parametric  function  of  center  frequency  and  delay  as  Illustrated  In 
Figure  19.  The  figure  was  generated  by  first  drawing  smooth  curves 
by  eye  through  the  data  plotted  as  functions  of  center  frequency 
with  delay  as  parameter  and  as  functions  of  delay  with  center 
frequency  as  parameter.  Iso-contour  curves  of  the  parametric 
function  were  drawn  by  eye  through  points  generated  from  orthogonal 
projections  of  points  from  the  previously  drawn  curves. 

The  contours  Illustrated  in  Figure  19  appear  to  be  very  well 
behaved.  The  contours  would  appear  to  asymptotically  converge  with 
three  further  halflngs  of  center  frequency  at  approximately  31  Hertz 
or  the  lower  frequency  limit  of  hearing.  One  may  obeerve  from 
Figure  19  that  for  banduidths  greater  than  100  Hertz  and  for  a delay 
of  0.0625  milliseconds  that  a discrimination  score  of  0.75  is 
achieved  at  approximately  1000  Hertz  center  frequency.  For  a center 
frequency  of  250  Hertz,  the  0.75  score  is  achieved  at  approximately 
0.25  mill  I seconds. 
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Figure  19.  Relative  effect  of  center  frequency  and  delay  on 
di scrim i nab i I i ty. 

In  summary,  a series  of  experiments  was  conducted  to 
determine  the  di scr i mi  nab i I i ty  between  a standard  impulse  and 
impulses  in  which  a band  of  frequency  components  are  delayed.  The 
parameters  of  Interest  were  intensity,  center  frequency,  bandwidth 
and  delay.  Di scriminabi I i ty  was  shown  to  be  highly  dependent  on 
intensity.  At  a sensation  level  of  80  dB,  d: scr iminabi I i ty  was 
shown  to  be  mildly,  yet  significantly  dependent  on  bandwidth  for 
bandwidths  greater  than  100  Hertz.  The  contribution  of  center 
frequency  and  delay  to  di  scr  iminabi  I i ty  was  shown  to  be  inversely 
related.  Discrimination  scores  of  0.75  were  achieved  for  center 
frequencies  between  250  and  500  Hertz  at  delays  between  0.0625  and 
0.125  milliseconds. 
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The  phase  dl scrlmlnabl 1 1 ty  function  was  at  first  hoped  to  be 
determined  with  high  resolution  by  conducting  many  testa  In  uhleh 
the  parameters  of  interest  could  be  adjusted  over  a wide  range  in 
relatively  small  Increments.  Initial  attempts  by  the  author  in 
determining  the  range  of  interest  of  the  experimental  variables  uas 
frustrated  by  the  time  required  to  measure  accurately  a point  in  the 
function,  and  the  apparently  large  variability  of  the  function. 
Only  an  approximate  estimate  of  the  phase  di scr i mi  nab i I i ty  function 
must  consequently  suffice. 


CHAPTER  VII 


SENSITIVITY  TO  DISPERSION  IN  SPEECH 

Direct  assessment  of  the  effect  of  dispersion  on  continuous 
speech  is  difficult  because  of  the  short-time  spectral  complexity  of 
speech  and  the  constant  shift  in  short-time  spectra  during 
production  of  the  speech  signal.  The  difficulty  In  theoretically 
estimating  the  d i scr i minabi li ty  of  dispersion  based  on  short-time 
spectral  features  has  been  not  so  much  due  to  the  inherent 
complexity  of  the  spectra  Itself  as  to  lack  of  information 
concerning  the  sensitivity  of  the  ear  to  dispersion  in  general.  One 
purpose  of  the  experiments  discussed  in  Chapter  VI  uas  to  learn  more 
about  phase  sensitivity  so  as  to  make  possible  generalizations  about 
phase  modified  continuous  speech. 

Speech  consists  of  strings  of  concatenated  phonemes  whose 
short-time  spectra  are  modified  by  such  factors  as  articulation, 
coarticulation,  the  emotional  state  of  the  speaker,  etc.  The 
conetant  shift  In  the  short-time  spectra  of  speech  makes  assignment 
of  a phase  di scrlmlnabl 1 1 ty  statistic  to  continuous  speech  difficult 
because  of  the  varying  degree  of  dependence  of  phase 
dl  ecr imlnabi I i ty  on  each  successive  phoneme.  For  example,  one  would 
expect  less  sensitivity  to  dispersion  in  fricatives  because  of  their 
already  nolee-like  character i st ics  than  in  plosives  with  their 
sudden  and  relatively  simultaneous  onset  of  frequency  componente. 
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One  might  expect  that  dispersion  in  a spectral  region  in 
which  speech  has  statistically  little  energy  would  result  in  smaller 
overall  subjective  effects  than  would  dispersion  in  a spectral 
region  with  a statistically  higher  energy  concentration.  A tacit 
assumption  of  this  expectation  is  that  the  perception  of  quality 
change  is  due  to  cumulative  effects  for  each  phoneme  over  the 
phonetic  utterance.  One  could  hypothesize,  consequently,  that  a 
general  dl ecr Imlnabl 1 1 ty  statistic  for  continuous  speech  might  be 
derived  from  a weighted  sum  of  the  Individual  effects  of  dispersion 
on  the  constituent  phonemes.  On  the  other  hand,  quality  changes  may 
become  readily  apparent  if  only  one  or  two  relatively  "sensitive" 
phonemes  are  are  dispersed  only  slightly. 

Little  Is  Known  of  the  ear’s  phase  di ecr i mi  nab i I i ty  ability 
outside  of  the  well  known  fact  that  excessive  dispersion  In  speech 
results  In  chirp-like  sounds.  The  intent  of  the  experiments 
discussed  In  this  chapter  is  to  further  investigate  the  phase 
dl  ecrimlnabl 1 1 ty  phenomenon  as  It  pertains  to  speech  stimuli. 

Sensitivity  to  Dispersion  in  Phonemes 

The  major  Intent  of  the  experiment  to  be  discussed  in  this 
section  is  to  obtain  a set  of  psychophysical  data  which  is 
indicative  of  human  discriminabi I i ty  of  dispersion  of  the  type 
Introduced  in  Chapter  IV  for  various  classes  of  phonemes.  Such  data 
should  be  of  interest  in  directly  probing  the  speech 
discriminabi I i ty  mechanism  of  hearing  inasmuch  as  various  test 
results  derived  from  psychophysical  measurements  using  different 
classes  of  phonemic  stimuli  can  be  analyzed  in  relation  to  the 
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stimuli.  The  data  should  also  be  applicable  to  indirectly 
estimating  human  dl scr iminabi I i ty  of  dispersion  in  continuous 
speech.  The  phonemes  of  choice  used  as  stimuli  in  the  current 
experiment  were  selected  to  represent  three  classes  of  phonemes  and 
consequently  included  the  neutral  vowel  /<9/,  the  plosive  /t/  and  the 
fricative  / f / . These  sounds  were  obtained  from  digital  recordings 
as  discussed  in  Chapter  IV.  The  vowel  stimulus  was  obtained  from  a 
recording  of  the  vowel  spoken  in  isolation  whereas  the  plosive  and 
fricative  were  extracted  from  recordings  of  single  words  spoken  In 
Isolation.  The  phoneme  extraction  process  consisted  of  displaying 
the  signal  from  which  a phoneme  was  to  be  extracted  on  a storage 
oscilloscope  so  a9  to  determine  the  beginning  and  ending  sample  of 
the  phoneme  segment.  Auditory  playback  was  used  to  insure  the 
quality  of  the  Isolated  segment.  A trapezoidal  window  with  a rise 
and  decay  time  of  20  milliseconds  was  used  to  isolate  the  segment 
after  which  the  segment  was  copied  to  disk  as  a new  audio  file  for 
later  reference. 

The  phonemes  were  presented  In  the  dl scr Imlnat  i on  teste 
during  the  course  of  the  experiment  at  their  normal  conversational 
speech  level.  The  levels  were  determined  in  the  following  wayt 
Stimulus  thresholds  for  the  two  words  spoken  in  isolation  were 
determined  by  the  method  of  adjustment.  The  author  conducted  a 
series  of  tests  with  himself  as  subject  in  which  he  alternately 
decreased  the  setting  of  the  signal  attenuator  in  the  audio  playback 
circuit  until  the  stimuli  just  became  inaudible  and  then  increased 
the  setting  of  the  attenuator  setting  until  the  stimuli  just  became 
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audible.  The  average  attenuator  setting  was  approximately  90  dB. 
The  author  determined  that  both  stimuli  were  fully  Intelligible  at 
an  attenuator  setting  of  50  dB,  which  is  to  say,  a sensation  level 
of  40  dB.  The  level  at  which  the  attenuator  wae  set  to  produce 
"conversational  level"  speech  was  20  dB  for  a sensation  level  of  70 
dB.  The  20  dB  setting  was  used  throughout  the  course  of  the 
experiment.  The  method  of  adjustment  was  used  to  determine  the 
relative  levels  of  the  isolated  phonemes.  The  sensation  levels  of 
the  vowel,  plosive  and  fricative  were  determined  respectively  to  be 
70,  32  and  35  dB  in  reference  to  the  attenuator  setting  previously 
determined  for  conversational  level  speech. 

The  experiment  consisted  of  administering  a series  of 
discrimination  tests  for  each  phoneme.  Each  test  consisted  of  the 
random  presentation  of  16  unique  stimulus  pairs  for  discrimination. 
The  set  of  stimulus  pairs  peculiar  to  a given  test  consisted  of  one 
of  the  three  standard  phonemes  followed  by  one  of  the  16  phase 
modified  variants  of  the  standard  phoneme.  The  stimuli  of  all  16 
pairs  were  consequently  different.  The  author,  serving  as  his  own 
subject  in  the  experiment,  was  of  necessity  aware  that  each  pair  was 
different.  Control  stimulus  pairs  in  which  ths  stimuli  of  the  pair 
were  Identical  were  not  deemed  necessary  inasmuch  ae  the  author  wae 
motivated  not  to  make  random  responses  as  to  the  equality  of  the 
stimuli  of  the  pair.  The  validity  of  the  testing  paradigm  le 
attested  by  the  well  behaved  nature  of  the  data  when  plotted  and  by 
low  test  variance  when  repeated.  The  author  set  for  himself  the 
task  of  responding  that  the  stimuli  of  a pair  were  different  only 
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when  persuaded  that  the  stimuli  actually  were  different.  The  author 
responded  that  the  stimuli  were  the  same  If  no  difference  In  the 
stimuli  could  be  detected.  A discrimination  score  of  0.5  uas  chosen 
to  be  indicative  of  the  author’s  diecriminabi I i ty  inasmuch  as  ail 
the  stimulus  pairs  contained  different  stimuli. 

Each  of  the  16  phase  modified  variants  of  a standard  phoneme 
peculiar  to  a given  test  uas  randomly  presented  10  times  during  the 
course  of  a given  test.  The  pairs  uere  presented  monaural ly  to  the 
right  ear  at  the  sensation  levels  already  discussed.  The 
Intra-stimulus  Interval  of  each  stimulus  pair  uas  approximately  500 
ml  1 1 1 seconds. 

Four  values  each  of  the  three  independent  variables  uere 
chosen  to  generate  64  unique  phase  conditions  for  testing.  The 
center  frequencies  included  250.  500,  1000  and  2000  Hertz; 
banduidths  included  50,  100,  200,  400  and  800  Hertz;  and  delays 
Included  2,  4,  8,  16  and  32  milliseconds.  The  phase  of  the  standard 
vouel,  plosive  and  fricative  uere  modified  according  to  the  schedule 
just  described  to  generate  a set  of  stimuli  to  be  used  in  subsequent 
testing  of  dlscrlmlnabl I i ty. 

A test  ssrlss  consisting  of  four  tssts  as  already  dlecueeed 
uas  administered  for  each  of  the  three  phonemes.  Each  test  in  the 
eeriee  represented  a set  of  phase  conditions  in  uhich  the  center 
frequency  of  the  delayed  band  uas  constant.  The  16  pairinge 
represented  by  each  test  corresponded  to  the  unique  pairings  of  four 
values  each  of  banduidth  and  delay.  All  64  phase  conditions  could 
thus  be  tested  by  administering  the  four  tests.  The  order  in  uhich 
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the  teete  of  the  eerlee  uas  administered  uae  random.  The  teat 
eerlee  for  the  vouel  uae  administered  three  times  uhlle  those  for 
the  plosive  and  fricative  were  administered  tuice.  The  average 
dl  ecr iminabi I i ty  estimates  for  the  different  phase  conditions  are 
tabulated  in  Table  7,  8 and  3 (Appendix  C)  and  are  illustrated  in 
Figures  20  through  28.  The  intent  of  the  author,  as  in  the 
experiment  ulth  impulse  stimuli,  uae  to  determine  trends  in  the  data 
by  first  plotting  the  data  as  functions  of  each  of  the  three 
Independent  variables,  to  characterize  the  the  trends,  and  then  to 
discuss  the  characterizations.  Three  sets  of  curves  are  illustrated 
for  each  phoneme.  Each  set  of  curves  represents  the  data  as  plotted 
as  a function  of  each  of  the  three  independent  variables.  The  roue 
and  columns  of  curves  in  each  set  of  curves  represent  the  data  for 
various  parametric  values  of  the  remaining  independent  variables. 
For  example,  Figure  20  illustrates  the  di scr iminabi I I ty  of 
dispersion  in  the  vouel  phoneme  plotted  as  a function  of  delay*  The 
roue  parametrical Ig  represent  banduidth  and  the  columns  represent 
center  frequency.  Effects  of  parametric  variations  mag  be  observed 
bg  comparing  curves  across  rous  or  doun  columns. 

The  majority  of  the  curves  in  Figure  20  shou  that  phase 
dlscrlmlnabl 1 1 ty  in  the  phase  modified  vouel  is  mono ton i cal Ig 
dependent  on  delay  for  a given  center  frequency  and  banduidth.  A I I 
of  the  curves  shou  the  monotonic  increase  except  for  those 
representing  phase  conditions  of  500  Hertz  center  frequency  and  400 
or  800  Hertz  banduidth.  The  significance  of  the  tuo  exceptions  ie 
not  clear. 
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Figure  21.  Oi scriminabil i ty  of  Subject  UG  between  a standard 
neutral  vowel  stimulus  and  phase  modified  variants  of  the  vowel 
plotted  as  a function  of  bandwidth. 
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Figure  24.  01 scriminabi I i tg  of  Subject  UG  between  a standard 
plosive,  /t/,  and  phase  modified  variants  of  the  plosive  plotted  as 
a function  of  bandwidth. 
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Figure  27.  D1 scriminabi 1 i ty  of  Subject  UG  between  a etandard 
fricative,  / f / , and  phase  modified  variants  of  the  fricative  plotted 
at  a function  of  bandwidth. 
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The  locus  of  the  curves  Is  generally  displaced  more  and  more 
to  the  upper  left  as  columns  of  curves,  representing  parametric 
increases  in  banduidth,  are  considered  from  top  to  bottom.  The 
displacement  implies  an  increase  in  discriminabi I i ty  with  banduidth. 
Approximately  8 to  16  milliseconds  of  delay  is  required  for  0.6 
dl scr Imlnabl 1 1 ty  at  narrower  banduidths  whereas  only  2 to  4 
milliseconds  is  required  at  wider  bandwidths.  The  difference  in 
range  of  delays  is  most  likely  due  to  the  contribution  of  increased 
banduidth  on  di ecrlminabi I i ty.  The  locus  of  the  curves  is  displaced 
more  and  more  to  the  lower  right  as  roue  of  curves,  representing 
parametric  Increases  in  center  frequency,  are  considered  from  left 
to  right.  The  shifting  locus  implies  a slight  decrease  in 
di scriminabi I I ty  with  center  frequency.  The  contribution  of  center 
frequency  to  discriminabi  I i ty  appears  to  be  much  less  than  that  of 
delay  or  banduidth. 

The  curves  of  Figure  21  show  the  dependence  of 
di  ecrlminabi 1 1 ty  on  bandwidth  more  explicitly.  Approximately  200  to 
800  Hertz  banduidth  is  required  for  0.5  discriminabi I i ty  for  shorter 
delays  whereas  only  100  to  400  Hertz  bandwidth  Is  required  for 
longer  delays.  The  shifting  locus  of  curves  to  the  upper  left 
across  the  rows  of  curves,  representing  parametric  increases  in 
delay,  implies  the  dependence  of  discriminabi I i ty  on  delay.  The 
slight  shifting  of  locus  to  the  iower  right  down  the  column  of 
curves,  representing  parametric  increases  in  center  frequency,  shows 
again  the  slight  decrease  In  dl scr Imlnabl 1 1 ty  with  increased  center 
frequenciee. 
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The  slight  decrease  in  di ecrlminabi 1 l ty  with  increased 
center  frequency  is  shown  more  explicitly  in  Figure  22  where  center 
frequency  is  the  independent  variable.  The  relation  appears  to  hold 
only  for  bandwidths  less  than  800  Hertz.  0 i scr 1 mi  nab  I I I ty  appears 
to  peak  at  a center  frequency  of  1000  Hertz  for  bandwidths  of  800 
Hart*. 

The  majority  of  the  curves  of  Figures  23  and  2G  Indicate, 
respectively,  that  phase  dl scr iminabi I i ty  in  phase  modified  plosives 
and  fricatives  is  also  a function  of  delay.  The  shifting  locus  of 
the  curves  along  rows  and  columns  implies  a dependence  of 
dl ecrlminabi 1 1 ty  on  Increasing  bandwidth  and  decreasing  center 
frequency  as  in  the  case  of  the  vowel.  Approximately  4 to  8 
mlllieeconds  of  delay  in  the  plosive  is  required  for  0.5 
dl  scr Iminabi 1 1 ty  at  narrower  bandwidths  whereas  only  2 to  4 
milliseconds  are  required  for  wider  bandwidths.  From  16  to  32 
milliseconds  of  delay  is  required  in  the  case  of  the  fricative  for 
narrower  bandwidths  and  from  4 to  8 milliseconds  for  the  wider 
bandwidths.  Two  anomalous  curves  appear  in  Figure  23  for  the 
plosive  phoneme.  The  phase  condition  in  which  the  delay  is  centered 
at  500  Hertz  with  100  Hertz  bandwidth  appears  to  bs  much  more 
dl ecr iminable  than  phase  conditions  with  ths  same  bandwidth  but  with 
lower  or  higher  center  frequency.  The  phase  condition  in  which  the 
delay  is  centered  at  2000  Hertz  with  200  Hertz  bandwidth  appears  to 
be  less  di  scr  iminable  than  it  should  be  as  judged  by  the 
dl  scr Iminabi II ty  of  the  surrounding  phase  conditions. 

Ths  same  discussion  regarding  Figure  21  In  which  the 
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dl scrlmlnabi 1 1 ty  of  dispersion  in  the  vowel  was  plotted  as  a 
function  of  bandwidth  applies  to  Figures  24  and  27  In  which  the 
plosive  and  fricative,  respectively,  are  similarly  plotted. 
01  ecr I mi nabl I I ty  is  definitely  related  to  bandwidth.  The  ehifting 
locus  of  the  curves  along  rows  and  down  columns  imply  increased 
di  scr i ml  nab  I I I ty  with  an  increase  in  delay  or  a decrease  in  center 
frequency.  No  anomalous  phase  conditions  are  noted  as  In  Figure  23. 

Approximately  200  to  400  Hertz  bandwidth  is  required  for  0.5 
dl  scr Iminabi I i ty  for  shorter  delay  in  the  plosive  whereas  only  100 
to  200  Hertz  Is  required  for  longer  delays.  From  400  to  800  Hertz 
bandwidth  is  required  for  shorter  delays  in  the  case  of  the 
fricative  whereas  only  100  to  400  Hertz  Is  required  for  longer 
de I ays. 

The  slight  decrease  in  discriminabi I i ty  with  an  increase  in 
center  frequency  as  shown  in  Figures  25  and  28  for  the  plosive  and 
fricative,  respectively,  is  not  as  apparent  as  that  shown  in  Figure 
22  for  the  vouel.  The  plosive  data  In  Figure  25  shows  a general 
decrease  In  discriminabi 1 1 ty  of  dispersion  in  plosives  for  center 
frequencies  of  500  Hertz  and  greater.  The  decrease  may  be  partially 
mediated  by  the  decreased  contribution  of  a constant  bandwidth  of 
delay  to  discriminabi 1 1 ty  at  higher  center  frequencies  because  of 
the  logarithmic  frequency  characteristic  of  the  ear.  The  hypothesis 
ie  weakly  supported  in  the  following  wayi  Each  of  the  epectra  of  the 
three  speech  sounds  were  visually  searched  for  similarities  of 
spectral  magnitude  and  composition  at  two  adjacent  center 
frequencies.  Only  the  fricative  phoneme  magnitude  had  such 
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similarities  which  occurred  at  center  frequencies  of  250  and  500 
Hertz.  Ths  averags  magnituda  of  the  energy  centered  at  500  Hertz 
was  approximately  2 dB  less  than  that  at  250  Hertz  but  the  greater 
sensitivity  of  hearing  at  500  Hertz  could  compensate  for  the  lower 
energy  I eve  I • The  discrimination  scorss  for  the  various  conditions 
of  bandwidth  and  delays  centered  at  250  Hertz  are  closer  in 
magnitude  to  the  values  for  the  next  higher  bandwidth  centered  at 
S00  Hertz  than  to  ths  corresponding  values  of  bandwidth  centered  at 
500  Hertz. 

Interpretation  of  the  data  as  plotted  as  a function  of 
center  frequency  is  not  straight  forward.  The  changing  locus  of  the 
curves  across  rows  In  Figures  22,  25  and  28,  which  represent 
parametric  Increases  in  delay,  Is  presumably  due  to  the  effect  of 
delay  alone.  The  presumption  is  not  supported,  however,  because 
notwithstanding  a nebulous,  but  general  similarity  of  curve  shape 
across  rows,  especially  those  representing  bandwidths  less  than  800 
Hertz,  there  appears  to  be  Interesting  variations  In  the  shape  of 
the  curves.  For  example,  the  400  Hertz  bandwidth  row  for  the  vowel 

I . 

In  Figure  22  appears  to  reverse  its  trend  as  delay  increases  from  4 
to  8 milliseconds.  Significant  changes  across  rows  may  be  observed 
also  In  Figures  25  and  28  for  the  plosive  and  fricative, 
respectively,  but  none  as  significant  as  In  the  case  of  the  vowel. 
How  the  extent  of  delay  can  effect  seemingly  unsystematic  variations 
in  di ecrimlnabi 1 1 ty  is  not  clear. 

The  changing  locus  of  curves  down  the  columns,  representing 
parametric  increases  in  bandwidth,  is  presumably  due  to  the  effect 
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of  engaging  a greater  and  greater  portion  of  the  phoneme  epectra  in 
the  band  of  delayed  frequency  components.  The  curvee  appear  similar 
In  general  down  each  column  with  the  exception  of  the  curvee  at  the 
800  Hertz  bandwidth  condition.  Corresponding  columns  are  aleo 
similar  for  the  different  phonemes.  Corresponding  columns  are  more 

similar  betueen  the  plosive  and  fricative  than  betueen  either  the 

. I 

plosive  or  fricative  and  the  vowel.  The  spectra  of  the  plosive  and 

fricative  is  more  similar.  The  similarity  across  phonemes  suggest e 

that  the  discrimination  scores  are  partially  Independent  of  the 

phoneme  type.  The  dissimilarities  are  due  to  differences  In  the 

epectra.  The  di ecr iminabi I i ty  data  trend  for  the  vowel  at  the  800 

Hertz  bandwidth  phase  condition  appears  to  be  the  reverse  of  that 

for  other  bandwidth  conditions.  The  reversal  does  not  appear  for 

the  plosive  and  fricative.  The  reversal  in  the  case  of  the  vowel 

appears  to  be  due  to  a peak  occurring  a 1000  Hertz  center  frequency. 

The  peaking  is  probably  due  to  the  engagement  of  a vowel  formant 

when  bandwidth  is  increased  to  800  Hertz.  No  other  immediate 

correlation  between  dlscrlminabl  1 1 ty  and  epectra  as  Illustrated  in 

Figure  29  has  been  observed. 

In  summary,  dl acr iminabi I 1 ty  has  been  shown  to  be  a function 
of  delay  and  bandwidth  for  all  three  classes  of  phonemes 
Investigated.  The  rank  ordering  of  the  phonemes  In  decreasing  order 
of  sensitivity  to  dispersion  Is  plosive,  vowel  and  fricative.  For 
narrow  bandwidths  the  plosive  requires  4 to  8,  the  vowel  8 to  16  and 
the  fricative  16  to  32  milliseconds  of  delay  for  0.5 

dlscrlminabl 1 1 ty. 
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Figure  29.  Dagnitude  spectra  of  the  phonemes  used  in  the  study. 


For  wider  bandwldthe  the  respective  ranges  are  2 to  4,  2 to  4 and  4 
to  8 milliseconds.  For  shorter  delays  the  plosive  requiree  200  to 
400,  the  vowel  200  to  800  and  the  fricative  400  to  800  Hertz 
bandwidth  for  0.5  dl scriminabi I i ty.  For  longer  delays,  the  required 
banduldth  ranges  are  100  to  200,  100  to  400  and  100  to  400  Hertz 

c t 

respectively.  The  difference  in  range  for  different  bandwidths  or 
delays  is  probably  due  to  the  respective  contribution  of  increased 
delay  or  bandwidth  on  di scr iminabi I i ty. 

The  effect  of  quantization  noise  became  more  apparent  with 
phonemic  stimuli.  Such  noise  arises  when  the  samp  lee  of  a 
continuous  signal  are  quantized  to  one  of  a set  of  discrete  values 
for  storage  In  computer  memory.  The  quantization  noise  quality  was 
that  of  an  additive  tone.  The  noise  was  absent  in  the  standard 
stimuli.  One  could  argue  that  the  test  discriminations  would  be 
Invalid  inasmuch  as  a subject  could  discriminate  a difference 
between  the  two  stimuli  of  the  stimulus  pair  simply  by  the  presence 
of  quantization  noise.  The  author,  serving  as  hie  own  subject 
however,  was  aware  that  the  first  stimulus  of  each  test  pair  was  the 
standard  stimulus  and  that  the  stimuli  of  all  the  test  pairs  were 
different.  The  quantization  noise  was  clearly  evident  as  such  In 
most  of  the  stimulus  pairs  and  could  be  consciously  Ignored  with 
relatively  little  effort.  Strong  evidence  to  support  the  validity 
of  the  test  scores  is  the  Internal  consistency  of  the  test  scores 
and  the  external  consistency  of  the  test  scores  with  those  for  words 
and  sentences  as  will  be  discussed.  Further  support  of  the  results 
le  given  by  their  general  agreement  with  the  results  of  informal 
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dl Berlin ination  teete  in  uhich  a slightly  phase  modified  phoneme  with 
consequent  quantization  noise  uas  used  as  the  standard  stimuli.  A 
possible  approach  to  avoiding  the  quantization  noise  problem  Is  to 
add  a masking  noise  to  all  the  stimuli  used  in  the  experiment. 

Sensitivity  to  Dispersion  in  Uorde 

The  perceptual  quality  of  uorde  In  uhich  a band  of  frequency 
components  is  delayed  uas  observed  through  informal  experiments  to 
be  a function  of  delay  for  a given  center  frequency  and  banduidth. 
The  change  in  sound  quality  uas  observed  for  shorter  delays  at  the 
beginning  of  uords  uhile  the  change  in  quality  uas  observed  for 
longer  delays  over  the  total  duration  of  the  uord. 

One  uould  expect  that  the  onset  of  dispersed  uords  beginning 
with  sibilants  uould  be  less  discriminable  than  uords  beginning  uith 
plosives  because  of  the  noise-like  quality  of  sibilants.  The  author 
served  as  his  oun  subject  In  an  experiment  to  test  the  hypothesis. 
The  experiment  consisted  of  tuo  test  series,  one  for  the  uord  "DO" 
and  or.r»  for  the  uord  "SEE."  The  parameters  of  the  phase  distortion 
uere  center  frequency  of  500  Hertz,  banduidth  of  400  Hertz  and  time 
delays  of  8,  16,  32  and  64  milliseconds. 

Each  test  series  consisted  of  four  tests,  one  for  each  of 
the  four  phase  conditions.  The  order  of  the  test  presentation 
within  a series  uas  randomized.  Each  test  consisted  of  40 
randomized  pair  presentation  in  uhich  each  stimulus  of  the  pair 
could  be  either  the  standard  word  or  the  phase  modified  variant  of 
the  uord  as  already  discussed.  The  stimulus  pairs  uere  monaural  ly 
presented  to  the  right  ear  at  a "comfortable"  listening  level.  The 
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results  of  the  experiment  are  illustrated  in  Figure  30.  The  plosive 
onset  mag  be  seen  from  the  results  of  the  experiment  to  be  more 
dl  scrlmlnable  than  the  fricative  onset  which  supports  the 
hypothesis.  The  delays  required  for  0.75  dl scr iminabl I I ty  for  the 
two  initial  sounds  are  similar  to  those  found  in  the  phoneme 
experiment. 


DELAY  (MILLISECONDS) 

Figure  30.  Discriminabi I i ty  of  Subject  UG  to  phase  modified  words 
as  a function  of  delay. 

Sensitivity  to  Dispersion  in  a Sentence 
An  experiment  was  conducted  to  determine  the 
dl ecr Iminabl 1 1 ty  of  dispersion  in  the  test  sentence.  "Men  etrive  but 
seldom  get  rich."  The  three  female  subjects  introduced  in  Chapter  VI 
served  as  subjects.  A single  test  in  which  five  individual  phase 
conditions  were  randomly  tested  was  administered  twice  to  each  of 
the  subjects  to  obtain  two  discriminabi 1 1 ty  estimates  for  each  phase 
condition.  The  subjects  were  required  during  the  course  of  the  test 
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to  determine  whether  or  not  the  stimuli  of  a stimulus  pair  were  the 
"same"  or  "different.”  The  phase  modification  were  effected  by 
delaying  a 400  Hertz  band  of  frequency  componente  centered  at  500 
Hertz  for  different  values  of  delay  which  included  2,  4,  8 and  1G 
milliseconds.  The  stimulus  pairs  were  presented  monaural  ly  to  the 
subject’s  right  ear  at  a level  approximating  "conversational  level" 
speech.  Each  phase  condition  was  presented  10  times  during  the 
course  of  the  test  such  that  each  test  consisted  of  50  stimulus  pair 
presentations.  Half  of  the  test  pairs  for  each  phase  condition 
consisted  of  the  standard  sentence  followed  by  its  phase  modified 
variant  whereas  the  other  half  consisted  of  the  variant  followed  by 
the  standard  sentence.  The  stimuli  of  every  pair  was  consequently 
different  but  the  subjects  were  instructed  that  naif  of  the  pairs 
were  the  same.  This  expedient  was  adopted  to  reduce  the  excessive 
length  of  the  testing  period.  The  results  of  the  experiment  are 
1 I lustrated  in  Figure  31. 


Figure  31.  Olscriminabi  1 1 ty  of  subjects  AK,  MF  and  PA  to  phase 
modified  sentence  as  a function  of  delay. 
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0 1 scr i mi  nab i I i ty  is  seen  to  reach  the  O.G  level  for  delays 
In  which  a band  of  frequency  components,  centered  at  500  Hertz  and 
of  400  Hertz  bandwidth,  Is  delayed  between  4 and  8 ml  1 I i seconds. 
The  score  Is  similar  to  that  of  the  vowel  under  the  ear9  phase 
conditions  discussed  In  the  phoneme  experiment.  The  subjects 
apparently  cue  on  vowel  dispersions  when  discriminating  dispersion 
In  continuous  speech.  Greater  dispersions  are  apparently  tolerated 
In  continuous  speech  in  which  no  absolute  standard  le  available  for 
comparison.  The  certainty  of  this  cone  In?'; however,  ie  in  doubt 
because  of  the  lack  of  control  for  intensity  level  between  this 
experiment  and  that  for  the  phonemes  inasmuch  as  di  scriminabi I 1 ty 
has  been  shown  to  be  a function  of  intensity.  Absolute  judgments 
about  the  quality  of  the  vowel  sounds  may  be  the  deciding  factor  in 
whether  or  not  a given  phase  dispersion  is  perceived. 


CHAPTER  VI 11 


MONAURAL  SENSITIVITY  TO  OISPERSION  IN  IMPULSES  AND  SPEECH i 
SUMMARY  ANO  CONCLUSIONS 

A study  was  conducted  to  determine  the  limits  of  human  phase 
sensitivity.  In  review,  the  role  of  psychophysics  in  the  design  of 
speech  processing  systems  and  the  use  of  computer  techniques  in 
psychophysical  research  were  discussed.  The  history  of  the  "phase 
sensitivity  question"  was  revieued  and  its  nature  clarified.  In 
part,  Ohm  postulated  the  rule  by  uhich  the  ear  analyzes  sound  and 
Helmholtz  added  the  phase  rule  as  the  result  of  his  own  work.  The 
validity  of  Helmholtz*  phase  rule  was  discussed  and  shown  to  be 
correct  under  the  conditions  he  imposed.  Helmholtz  postulated  that 
the  ear  would  be  sensitive  to  phase  under  certain  conditions. 

The  work  of  investigators  from  the  time  of  Helmholtz  to  date 
was  reviewed  and  discussed.  The  majority  of  the  work  supports  the 
fact  that  the  ear  is  sensitive  to  phase.  Early  failures  in 
demonstrating  phase  sensitivity  were  probably  due  to  such  factors  ae 
equipment  limitations  and  theoretical  bias.  A general  intent  of 
much  of  the  early  work  was  the  demonstration  of  various  "phase 
af facta,"  The  relevance  of  such  uork  in  the  design  of  speech 
processing  systems  was  revieued  and  the  author's  philosophical 
approach  to  the  phase  sensitivity  question  was  introduced. 
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Part  of  the  author’s  contribution  Mas  the  philosophical 
approach  to  the  determination  of  the  limits  of  phase 

di  acrimlnabl I i ty  In  phase  distorted  speech  as  a pract  ical  goal  of 
phase  sensitivity  research.  The  application  of  computer  techniques. 
Including  discrete  Fourier  transforms,  to  signal  processing  uas 
discussed.  A theoretical  analysis  of  phase  distortion  was 
undertaken  to  provide  insight  into  the  nature  of  phase  such  that  the 
study  could  be  conducted  with  a solid  understanding  of  the 
phenomenon.  Two  types  of  experiments  were  described.  The  first  was 
concerned  with  di scr Iminabi 1 1 ty  of  dispersion  in  Impulses  and  the 
eecond  with  dispersion  in  speech  signals.  The  reBulte  of  the 

experiments  were  discussed  in  their  respective  chapters. 

The  results  of  the  study  cannot  be  compared  directly  with 
those  of  other  studies  because  of  basic  differences  in  the  stimuli 
and  the  type  of  phase  modification  used.  The  study  uae  not  designed  1 
to  explore  a new  "phase  effect"  but  to  determine  the  limits  of  phase 
dl  scrimlnabi I I ty  under  certain  conditions  as  functions  of  the 
parameters  of  an  Idealized  general  type  of  phase  distortion.  The 
general  exploratory  nature  of  the  study  preclude  extensive 
theoretical  Insight  into  ths  exact  mechanism  by  which  phase 
d i scr I m i nabi I i ty  is  effected.  No  general  predictions  are  stated 
based  on  the  results  of  the  study  because  of  i te  generally 
exploratory  nature. 

A number  of  criticisms  concerning  the  study  should  be 
discussed.  The  exact  characteristics  of  the  Koss  PR0-4A  headphones 
used  throughout  the  study  were  unknown.  It  was  felt  that  the  lack 
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of  Information  concerning  the  headphonee,  however,  wae  a minor 
problem  inasmuch  as  the  testing  paradigm  required  a discrimination 
between  two  stimuli  both  of  which  were  presented  with  the  same 
headphones.  Smaller  banduidths  should  have  been  used  in  the  impulse 
study.  Better  techniques  need  to  be  developed  to  analyze  and 
dleplay  data  of  three  or  more  independent  variables.  The 
theoretical  Implications  of  the  data  obtained  during  the  course  of 
this  study  have  not  been  exhausted  fully  simply  because  of  the 
complexity  of  the  analysis.  Better  computer  graphing  techniques 
should  be  developed  such  that  the  investigator  may  quickly  display 
the  data  In  various  aspects  of  the  independent  variables. 
Statistical  reduction  may  be  incorporated  profitably  in  the  graphic 
software. 

A possibly  interesting  variation  of  the  phoneme  experiment 
would  be  to  cepstrually  smooth  the  magnitude  of  the  phone ne  spectra 
eo  as  to  remove  any  confounding  effects  of  rapidly  varying  magnitude 
when  Investigating  the  effect  of  center  frequency  on 
d 1 scr I m I nab i I 1 ty.  The  investigator  should  have  normal  hearing  such 
that  he  may  adequately  investigate  phase  dl ecrimlnabl I 1 ty  using 
himself  as  a subject  before  conducting  extensive  teste  with  othe' 
subjects  so  as  to  best  utilize  computer  time.  Further  research  i i 
recommended  in  this  area  so  as  to  replicate  the  original  study  and 
to  further  investigate  its  ramifications. 

The  intent  of  the  author  has  been  to  supply  enough  general 
Information  to  permit  Independent  replication  of  the  study.  In 
review,  an  Investigator  must  obtain  a digital  recording  of  the 
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stimulus  to  be  tested;  determine  a eet  of  values  for  the  parameters 
of  the  phase  modifications  to  be  tested;  generate  the  filters 
necessary  to  effect  the  phase  modifications;  generate  a set  of  phase 
modified  stimuli  by  convolving  the  standard  stimulus  with  each  of 
the  filters;  and  finally  presenting  the  phase  modified  stimuli  to  a 
listener  under  a given  testing  paradigm.  All  of  the  software 
required  to  effect  the  above  has  been  written  by  the  author  as  an 
interactive  computer  program.  The  program,  GSYS.SAI,  and  its 
documentation  has  been  archived  with  the  SIPG  software  at  the 
University  of  Utah.  Independent  duplication  of  the  software  would 
require  considerable  time  and  expense.  Salient  features  of  the 
software  are  dlecueeed  in  Appendix  D. 


APPENDIX  A 


ON  THE  DIFFERENCE  IN  POPULAR  OPINION  AS  TO  HHAT  CONSTITUTES 

OH(1’S  LAU  OF  ACOUSTICS 

In  reviewing  the  literature  concerning  Ohm's  law  of 
acoustics,  It  becomee  apparent  that  there  may  be  differences  In 
opinion  aa  to  what  constitutes  the  law.  It  is  generally  accepted 
that  the  law  pertains  to  the  frequency  analyzing  ability  of  the  ear. 
Confusion  arises  as  to  whether  or  not  Ohm  (1843)  included  statements 
pertaining  to  the  insensitivity  of  the  ear  to  phase  relations  among 
the  harmonics  of  a complex  sound,  or  if  such  statements  were  later 
added  by  Helmholtz  (1863). 

To  obtain  an  estimate  of  the  extent  of  the  confusion  among 
authors  of  acoustical  literature,  a search  by  index  reference  to 
Ohm's  law  of  acoustics  was  made  of  pertinent  volumes  held  at  the 
University  of  Utah  main  library.  The  volumes  searched  were 
classified  under  Sound  (534)  and  Physiological  Audition  (612.85) 
within  the  Oewey  Decimal  System,  and  Sound  (QC  221-246), 

Physiological  Acoustics  (QP  461-471),  and  Literature  of  Music 
Acoustics  and  Physics  (ML  3805-3817)  within  the  Library  of  Congress 
System.  An  estimate  by  shelf  list  of  the  number  of  volumes 
comprising  these  sections,  made  by  the  librarian  at  the  current 
author's  request,  was  approximately  150  volumes.  Of  these  volumes, 
146  were  actually  searched.  Sixteen  volumes,  or  approximately  10 
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percent,  were  found  to  contain  the  index  reference  to  Ohm’s  law  of 
acoust I ce. 

The  16  volumes  represented  a range  In  publication  dates  from 
1877  to  1970  and  represented  the  writings  of  21  authors.  The 
statements  of  these  authors  concerning  Ohm's  law  are  tabulated  in 
Table  1. 

It  may  be  observed  from  Table  1 that  all  authors  made 
atatementa  concerning  the  frequency  analyzing  ability  of  the  ear  in 
reference  to  Ohm's  laui  there  Is  no  disagreement  among  authors  on 
this  point.  The  general  Interpretat ion  may  be  exemplified  by  a 
statement  by  hi  Her  (1916) « 

The  law  states!  all  musical  tones  are  periodic;  the  human 
ear  perceives  pendular  vibrations  alone  as  simple  tones;  all 
varieties  of  tone  quality  are  due  to  particular  combinations 
of  a larger  or  smaller  number  of  simple  tones;  every  motion 
of  the  air  which  corresponds  to  a complex  musical  tone  or  to 
a composite  mass  of  musical  tones  is  capable  of  being 
analyzed  into  a sum  of  simple  pendular  vibrations,  and  to 
each  simple  vibration  corresponds  a simple  tone  which  the 
ear  may  hear. 

It  la  generally  accepted  that  the  type  of  analysis  performed  by  the 
ear  is  a Fourier  analysis. 

Four  authors  made  statements  to  the  effect  that  Ohm's  law 
also  pertains  to  the  insensitivity  of  the  ear  to  phase.  Bekesy 
(I960),  citing  Helmholtz  and  Ohm,  stated  that  "According  to  Ohm's 
law,  the  perception  of  a complex  sound  depends  upon  its  analysis 
Into  single  tones  and  Is  entirely  independent  of  the  phase  relatione 
of  these  components  . . . ."  Bacus  (1969)  made  a similar  statement 
In  reference  to  Ohm’s  law  in  citing  Bekesy.  Rayleigh  (1877)  stated 
In  connection  with  hie  reference  to  Ohm's  law  that  ".  . . , 


within 
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the  limits  of  audibility  the  relative  phases  of  the  various 
components  would  be  a matter  of  indifference."  Stevene  and  Davie 
(1938)  wrote  that  the  usual  assertion  of  Ohm’s  law  ie  that  "the  ear 
tends  to  analyze  the  components  of  a complex  sound  regardless  of 
their  phase-relations." 

In  contrast  to  the  statements  of  the  four  authors  quoted 
above,  four  other  authors  implied  that  the  phase  sensitivity 
statements  were  later  added  by  Helmholtz.  Dittrich  (19G3)  wrote 
that  Helmholtz,  as  a result  of  his  experiments  with  synthetic 
sounds,  determined  that  "contrary  to  what  was  believed  up  to  thle 
time,  the  musical  timbre  of  a sound  depends  on  the  number  and  the 
Intensity  of  its  harmonics  and  not  on  their  phase  relation,  . . . 
Richardson  (1927)  also  Implied  that  phase  statements  were  not  part 
of  Ohm's  law  by  writing  that  "If  Ohm’s  law  be  true,  the  queetlon  le 
naturally  asked,  what  influence  have  the  respective  phases  of  the 
components  of  the  note  on  the  impression  of  its  quality?"  It  would 
appear  from  the  statement  that  the  effect  of  phase  on  quality  was 
not  part  of  Ohm’s  law.  In  support  of  this  conclusion,  Richardson  is 
further  cited  as  having  written  that  "Helmholtz  and  also  Kontg 
attacked  this  problem  experimentally." 

Uever  (1949)  wrote  that  Helmholtz  supported  his  resonance 
theory  of  hearing 

« • . from  his  experiments  on  the  synthesle  of  vowel e and 
other  complex  sounds.  Hs  showed  that  such  sounds  could  be 
Imitated  by  exciting  simultaneously  a number  of  tuning  forks 
suitably  chosen  in  frequency  and  intensity.  Only  frequency 
and  intensity  were  significant  variables  here;  the  phase 
relations  could  be  changed  at  will  without  noticeable 
alteration  of  what  he  referred  to  as  the  musical  quality. 

This  ineffectiveness  of  phase,  in  Helmholtz'B  view,  was  the 
result  of  the  fact  that  the  ear  analyzes  the  sound  into  its 
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component  simple  tones,  and  phase  relations  are  loet  In  the 
process. 

The  last  author  cited  to  Imply  that  Helmholtz  added 
statements  concerning  the  sensitivity  of  the  ear  to  phase  was  A. 
Uood  (1940),  who  stated  that  "Helmholtz  satisfied  himself  that 
relative  phase  Is  without  effect  on  the  quality  of  a note." 

Two  authors  definitely  stated  that  the  additions  were  made 
by  Helmholtz.  Lamb  (1910)  wrote  "According  to  Helmholtz  the 
Influence  of  phase  is  inappreciable."  Miller  (1916)  wrote  that  "Ae 
the  result  of  elaborate  investigation,  Helmholtz  added  the  following 
lawi  the  quality  of  a musical  tone  depends  solely  on  the  number  and 
relative  etrength  of  Its  partial  simple  tones,  and  In  no  respect  on 
their  differences  of  phase." 

In  support  of  the  statements  made  by  the  four  authors  just 
cited,  A.  B.  Uood  (1930)  categorically  stated  that  "Ohm’e  law  eaye 
nothing  about  the  relative  phases  of  the  harmonic  components  Into 

which  the  ear  analyses  the  sound " This  statement  is  in 

direct  conflict  with  those  of  Bacus,  Bekesy,  Rayleigh,  and  Stevens 
and  Oavls. 

The  Interpretat  ions  of  the  eight  authors  who  made  no 
raference  to  the  ear's  Insensitivity  to  phase  In  connection  with 
their  statements  of  Ohm’s  law  of  acoustics  is  not  clear.  It  ie  not 
unreasonable  to  think,  however,  that  the  lack  of  reference  implies 
that  their  Interpretation  pertains  only  to  the  frequency  analyzing 
ability  of  the  ear. 

In  summary,  19  authors  (or  col laboratore)  were  cited  who 
made  reference  to  Ohm’s  acoustical  law.  Of  these  authors  58  percent 
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made  statements  concerning  the  sensitivity  of  the  ear  to  phase  in 
reference  to  the  lau.  Of  this  group  36  psrcent  stated  that  Ohm's 
lau  Includes  phass  statements,  uhile  the  other  64  percent  stated,  or 
implied,  that  the  lau  does  not  include  phase  statements,  but  that 
the  statements  were  later  added  by  Helmholtz.  The  interpretat ions 
of  the  authors  uho  made  no  phase  statements  is  not  clear. 

Pertinent  journal  literature  uas  also  searched  to  further 
ascertain  the  extent  of  the  confusion  as  to  uhat  constitutes  Ohm's 
lau  of  acoustics.  Completeness  of  the  journal  survey  is  supported 
by  cross-referencing  among  the  articles.  Five  articles  are  cited 
uhlch  stats  that  Helmholtz  added  the  statements  concerning  phase 
indlffsrsncs  of  the  ear.  No  Journal  articles  uere  found  that  stated 
the  contrary. 

Beasley  (1931)  understood  that  the  effect  of  phase  upon  tone 
quality  uas  of  interest  even  before  Helmholtz'  research  but  urote 
that  Helmholtz  "announced  the  much-quoted  laui  'the  quality  of  a 
musical  tone  depends  on  the  number  and  relative  strength  of  its 
partial  simple  tones,  and  not  on  their  differences  of  phase.'" 
Beasley  referred  to  this  statement  as  Helmholtz'  "lau." 

Chapin  and  Firestone  (1934)  urote  that  "since  the  uork  of 
Helmholtz  It  has  been  uldely  accepted  that  the  tone  quality  of  a 

» 

comp  leu  tone  is  Independent  of  the  relative  phases  of  Its 
harmonics."  This  statement  incites  that  the  concept  originated  with 
Helmholtz. 

Schouten  (1939)  referred  to  the  frequency  analyzing  ability 
of  the  ear  as  Ohm's  lau  and  further  added  that  "According  to  a rule 
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proposed  by  Helmholtz  the  sound  perception  will  further  depend 
entirely  on  the  relative  Intensity  with  which  the  different 
components  occur  and  will  be  independent  of  the  relative  phase  of 
these  components." 

flathee  and  Miller  (1947)  stated  the  following  in  reference 
to  Helmholtz'  additions  concerning  the  sensitivity  of  the  ear  to 

phase i 

Ohm’s  acoustic  law  states  that  the  ear  analyzes  ;s  complex 
sound  into  simple  tones  independent  of  each  other. 
Helmholtz  as  the  result  of  carefully  conducted  experiments, 
amplified  this  statement  with  the  declaration  that 
".  . . the  quality  of  the  musical  portion  of  a compound  tone 
depends  solely  on  the  number  and  relative  strength  of  its 
partial  simple  tones,  and  in  no  respect  on  their  differences 
of  phase." 

Craig  and  Jeffress  (1962)  implied  Helmholt::'  additions 
pertaining  to  phase  by  writing  that  Helmholtz’  theory  did  not 
account  for  monaural  phase  effects  and  that  "he  avoided  the 
difficulty  by  denying  their  existence."  These  ajthors  cited 
Helmholtz  as  having  "classified  an  admittedly  heard  phase  determined 
auditory  effect  (the  faint  beats  produced  by  a tone  and  its  mietuned 
octave)  as  only  an  apparent  exception  to  hie  phase  rule." 

In  summary,  all  relevant  journal  articles,  checked  for 
inclusiveness  by  internal  cross-referencing,  indicated  unarimously 
that  Ohm’s  law  of  acoustics  doss  pertain  to  the  frequency  aralyzing 
ability  of  the  ear  and  that  Helmholtz  later  added  the  eti  tements 
concerning  the  phase  indifference  of  the  ear  as  a result  of  hie  own 
experimental  work. 

A careful  reading  of  Ohm  and  Helmholtz  reveale  that  Ohm  aid 
not  In  fact  mention  the  sensitivity  of  the  ear  to  phase  but  that 
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such  statements  uere  later  added  by  Helmholtz  as  a result  of  his  oun 
work. 

In  conclusion,  it  appears  that  the  vast  majority  of  authors 
from  the  time  of  Helmholtz  to  date  have  held  the  interpretation  of 
Ohm's  lau  summarized  immediately  above.  There  appear  to  be  a feu 
authors,  houever,  uho  hold  different  interpretat ions  euch  as  Bekesy, 
Rayleigh,  and  Stevens  and  Davis.  Uhether  or  not  these  authors  hold 
that  the  phrase  "Ohm’s  lau  of  acoustics"  includes  the  original 
statements  of  Ohm  plus  later  additions  by  Helmholtz  is  not  clear. 
It  ie  difficult  to  believe  that  these  authors,  in  particular,  could 
make  an  error  of  this  nature.  It  seems  more  likely  that  they  are 
using  the  phrase  "Ohm’s  lau  of  acoustics"  to  mean  in  a very  general 
sense  the  combined  contributions  of  Ohm  and  Helmholtz.  There  is 
precedence  in  euch  use  of  terms.  For  example,  Neuton’e  "universal 
lau  of  gravity"  is  correct  only  uithln  certain  limits.  Later 
contributions  by  Einstein  have  resulted  in  a more  correct  "universal 
lau  of  gravity,"  yet  the  phrase  "universal  lau  of  gravity”  la 
applied  uith  equal  correctness  to  the  concepts  of  Neuton,  the 
concepts  of  Einstein,  or  the  concepts  of  both  combined. 

It  uould  appear  then  that  in  referring  to  Ohm’s  lau  of 
acoustics  the  Investigator  is  at  liberty  to  apply  the  phrase  "Ohm's 
lau  of  acoustics"  in  a more  restricted,  generally  used,  and  correct 
aanse  In  reference  to  the  statements  of  Ohm,  or  In  a looser  sense  in 
reference  to  the  contributions  of  both  Ohm  and  Helmholtz.  The 
Investigator  uill  be  in  good  company  uith  either  use  of  the  phrase 
as  this  paper  has  demonstrated. 
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TABLE  1 

STATEMENTS  OF  SELECTED  AUTHORS  IN  REFERENCE  TO 
OHM’S  LAU  OF  ACOUSTICS 


Author 

Reference  Type  and  Page 

1 

2 

3 4 5 

6 

Bacus  (1969) 

99 

99 

Bekesy  (1960) 

471 

471 

Oenes  (1963) 

108 

X 

Dittrich  (1963) 

32 

33 

Lamb  (1910) 

2 

4 

Miller  (1916) 

62 

62 

Nordmark  (1970) 

58 

X 

Rayleigh  (1877) 

443 

443 

Richardson  (1927) 

232 

233 

Small  (1970) 

4 

X 

Stephens  and  Bates 

(1950)  7 

X 

Stevens  and  Oavis  (1938)  20 

203 

Tonedorf  (1970) 

232 

X 

Uard  (1970) 

438 

X 

Uever  (1949) 

26 

28 

Uever  and  Laurence 

(1954)  146 

X 

Whitfield  (1967) 

143 

X 

Uood.  A.  B.  (1930) 

356 

357 

Uood,  A.  (1940) 

340 

364 

1 Statements  that 

Ohm's  lau  pertains 

to  aural  frequency 

analysis. 

2 Statements  that 

Ohm’s  lau  also  pertains  to  the  ear’s 

insensitivity  to  phase. 

3 Statements  that  imply  that  Helmholtz  added  phase 


statements. 

4 Statements  that  Helmholtz  added  phase  statements. 

5 Statements  that  Ohm's  lau  does  not  include  any  reference 
to  the  sensitivity  of  the  ear  to  phase. 

6 No  reference  to  phase  in  reference  to  Ohm’s  law. 
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AN  INTRODUCTION  TO  DIGITAL  COMPUTERS 

A digital  computer  consists  basically  of  a memoru.  a 
processing  unit,  and  associated  peripheral  devices.  The  memory 
consists  of  a block  of  addressable  slots,  called  uorda.  in  which 
numbers  may  be  stored  or  retrieved.  The  processing  unit  acts  upon 
the  numbers  In  memory  to  perform  a logical  or  arithmetical  operation 
and  to  return  the  result  back  to  memory.  The  action  of  the 
processing  unit  Is  controlled  by  a sequential  list  of  instructions, 
called  a program,  which  is  also  stored  in  memory.  The  distinctive 
feature  of  a computer  that  makes  it  more  than  a programmed 
calculator  and  endows  it  with  its  computational  power  ie  that  the 
program  may  modify  itself  under  program  control. 

The  computational  power  of  computers  is  related  to  the 
number  or  instructions  in  their  repertoire,  the  speed  at  which  they 
may  execute  the  Instructions,  and  their  memory  elze.  Modern 
computere  are  able  to  execute  instructions  at  rates  approaching 
one-mil  I Ion  per  second,  and  have  memory  sizes  in  the  order  of  a 
quarter-mi  1 1 ion  words.  Memory  size  ie  colloquially  specified  In 
”K,"  where  one-K  represents  1024  words,  e.g.,  9G  < of  memory. 

Computers  uee  a binary  (base-2)  number  system  to  Internally 
represent  numbers  because  of  the  ease  in  which  such  numbers  may  be 
electronically  represented.  Only  two  digits  are  allowed  In  the 


120 


binary  number  system,  0 and  1.  The  digits  may  be  represented,  for 
example,  by  the  state  of  a bistable  electrical  device  such  as  a 
switch.  A ten  state  switch  would  be  required  to  represent  a digit 
In  the  base-10  number  system.  Binary  digits  are  colloquially 
referred  to  as  bi  ts.  In  any  given  computer,  each  memory  word 
contains  a given  number  of  bits.  The  greater  the  number  of  bite, 
the  larger  the  number  that  can  be  stored. 

Computers  have  a large  variety  of  associated  peripheral 
devlcee.  Many  of  the  devices  are  for  I nput-output  euch  ae 
teletypes,  high-speed  line  printers,  oecl  I loecopes, 
analog-to-digi tal  converters,  digi tal-to-analog  converters,  etc. 
Other  peripherals  include  magnetic  tape  and  disk  mass  storage 
devices.  The  data  base  for  a given  computer  problem  is  typically 
greater  then  what  can  be  stored  at  one  time  in  memory.  The  solution 
to  the  problem  is  to  store  the  data  base  on  disk,  sequentially  read 
Into  computer  memory  only  as  much  of  the  data  that  can  be  handled, 
process  the  data,  and  then  sequentially  wr  I te  the  data  back  out  to 
disk. 

Computer  applications  appear  to  be  limited  only  by  the 
complexity  of  the  program  which  a programmer  may  devlee,  Ite  epeed 
and  memory  eize,  and  the  character! st ics  of  its  input-output  system. 
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DATA  TABLES 


TABLE  2 

01 SCR I Ml  NAB I LI TY  OF  SUBJECTS  AX,  MF  AND  PA  BETWEEN  AN 
IMPULSE  AND  IMPULSES  IN  WHICH  A BAND  OF  FREQUENCY 
COMPONENTS  ARE  DELAYED  AS  FUNCTIONS  OF  BANDWIDTH  AND 

DELAY. 


BANDUJOTH  DELAY  (MILLISECONDS) 


(HERTZ) 

0.0625 

0.125 

0.25 

0.5 

SUBJECTS 

AK 

100 

0.47 

0.64 

0.63 

0.85 

200 

0.56 

0.66 

0.67 

0.91 

400 

0.55 

0.63 

0.84 

0.91 

800 

0.50 

0.58 

0.82 

0.94 

SUBJECTS 

MF 

100 

0.50 

0.49 

0.65 

0.76 

200 

0.52 

0.71 

0.68 

0.85 

400 

0.48 

0.53 

0.68 

0.90 

800 

0.48 

0.61 

0.79 

0.73 

SUBJECTS 

PA 

100 

0.53 

0.53 

0.60 

0.93 

200 

0.52 

0.46 

0.69 

0.84 

400 

0.54 

0.50 

0.54 

0.98 

800 

0.52 

0.52 

0.94 

0.97 
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TABLE  3 

01  SCR  I MI  NAB  1 L I TY  OF  SUBJECTS  AK,  MF  AND  PA  BETUEEN  AN  IMPULSE  ANO  A 
PHASE  MOOIFIEO  IMPULSE  AS  A FUNCTION  OF  INTENSITY.  THE  PARAMETERS 
ARE  500  HERTZ  CENTER  FREQUENCY.  400  HERTZ  BANDUIOTH  AND  0.2S 

MILLISECONDS  DELAY. 


SUBJECT 

RELATIVE 

-6 

INTENSITY 

0 

(DB) 

AK 

O.SS 

0.G7 

0. 

MF 

0.49 

0.68 

0. 

PA 

0.48 

0.49 

0. 
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TABLE  4 

□ I SCR  1 1*1 1 NAB  1 L I T Y OF  SUBJECT  UG  BETUEEN  AN  IMPULSE  ANO  PHASE 
MODIFIED  IMPULSES  AS  A FUNCTION  OF  INTENSITY.  PHASE  IS  MODIFIED 
AS  FUNCTIONS  OF  BANDWIDTH  AND  DELAY. 


DELAY  (mSec) 

68 

71 

INTENSITY 

74 

(dB  SL) 
77 

80 

83 

BU  - 100  HERTZ 

0.0625 

0.125 

0.500 

0.617 

0.550 

0.700 

0.725 

0.725 

0.783 

0.850 

0.25 

0.550 

0.650 

0.817 

0.983 

0.908 

0.983 

0.5 

0.650 

0.917 

0.967 

1.000 

BU  - 200  HERTZ 

0.0625 

0.467 

0.550 

0.750 

0.792 

0.125  0.517 

0.567 

0.671 

0.839 

0.922 

0.894 

0.25  0.567 

0.5  0.917 

0.583 

0.933 

0.833 

0.983 

0.992 

1.000 

0.983 

1.000 

BU  - 400  HERTZ 

0.0625  0.467 

0.500 

0.517 

0.625 

0.750 

0.833 

0.125  0.508 

0.489 

0.794 

0.850 

0.972 

0.25  0.561 

0.5  0.917; 

0.817 

1.000 

0.894 

0.992 

0.933 

0.961 

BU  - 800  HERTZ 

0.0625 

0.467 

0.517 

0.633 

0.783 

0.125  0.483 

0.550 

0.600 

0.750 

0.867 

0.850 

0.25  0.467 

0.5 

0.683 

1.000 

0.950 

0.950 

1.000 

1.000 
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TABLE  5 

OISCRiniNABILITY  OF  SUBJECT  UG  TO  DISPERSION  IN  IMPULSES  AS  A 
FUNCTION  OF  BANOUIOTH  AND  OELAY  FOR  A FIXED  CENTER  FREQUENCY  OF  500 
HERTZ  AND  SENSATION  LEVEL  OF  80  DB  USING  A CLASSICAL  CONSTANT 

STIMULI  EXPERIMENTAL  DESIGN. 


OELAY  (mSEC) 

100 

BANOUIOTH 

200 

(HERTZ) 

400 

800 

0.0625 

0.51 

0.55 

0.58 

0.68 

0.125 

0.52 

0.61 

0.79 

0.92 

0.25 

0.76 

0.92 

0.95 

0.96 

0.5 

0.89 

0.98 

1.00 

1.00 
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TABLE  6 

MEAN  ANO  STANOARD  DEVIATION  OF  THE  OISCRW1INABILITY  OF  SUBJECT  UG 
BETUEEN  AN  IMPULSE  ANO  IMPULSES  IN  UHICH  A BAND  OF  FREQUENCY 
COMPONENTS  ARE  DELAYED  AS  FUNCTIONS  OF  CENTER  FREQUENCY,  BANDUIOTH 

ANO  DELAY. 


DELAY 

BANDUIOTH  1 

[HERTZ) 

(mSEC) 

50 

100 

200 

400 

800 

1600 

FC  - 250 

HERTZ 

0.0625 

0.4833 

0.4667 

0.5500 

0.6000 

0.0204 

0.0540 

0.0354 

0.0707 

0.125 

0.5250 

0.5417 

0.6167 

0.7333 

0.0354 

0.0816 

0.1633 

0.1080 

0.25 

0.7833 

0.7417 

0.8917 

0.9500 

0.0816 

0.1744 

0.2041 

0.0612 

0.5 

0.9000 

0.9S83 

0.9583 

0.9917 

0.0354 

0.0204 

0.0408 

0.0204 

FC  - 500 

HERTZ 

0.0625 

0.4833 

0.6500 

0.6500 

0.7083 

0.6278 

0.0204 

0.1841 

0.1225 

0.1124 

0.0068 

0.125 

0.8500 

0.7333 

0.9222 

0.9722 

0.9056 

0.2121 

0.0408 

0.0360 

0.0136 

0.0593 

0.25 

1.0000 

0.9222 

0.9722 

0.9611 

0.9917 

0.0000 

0.0491 

0.0272 

0.0758 

0.0204 

0.5 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.0000 

0.0000 

3.0000 

0.0000 

0.0000 

FC  - 1000  HERTZ 

0.0625 

0.5750 

0.8417 

0.7250 

0.8083 

0.7750 

0.9125 

0.0354 

0.2131 

0. 1275 

0.1242 

0.0354 

0.0530 

0.125 

0.9083 

0.8583 

0.8750 

0.9917 

0.9S00 

0.9875 

0.0540 

0.1671 

0.1061 

0.0204 

0.0707 

0.0177 

0.25 

1.0000 

0.9917 

0.9917 

1.0000 

1.0000 

1.0000 

0.0000 

0.0204 

0.0204 

0.0000 

0.0000 

0.0000 

0.5 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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TABLE  6 (CONTI NUEO) 


OELAY 

BAN0U10TH 

(HERTZ) 

(MSEC) 

SO 

100 

200 

400 

800 

1600 

3200 

FC  - 2000  HERTZ 

0.062S 

0.6063 

0.7100 

0.7375 

0.9063 

0.8625 

0.8813 

0.7250 

0.1293 

0.2706 

0.1920 

0.1083 

0.1031 

0.1139 

0.1768 

0.12S 

1.0000 

0.8563 

0.9417 

0.9500 

0.9667 

0.9750 

0.9167 

0.0000 

0.1S72 

0. 0540 

0.1458 

0.0408 

0.0354 

0.0408 

0.2S 

1.0000 

0.9833 

1.0000 

1.0000 

1.0000 

1.0000 

1 . 0000 

0.0000 

0.0204 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

O.S 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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TABLE  7 


OISCRIMINABILITY  OF  SUBJECT  UG  BETUEEN  A 

STANDARD  NEUTRAL 

VOUEL 

STIMULUS  AND  PHASE  MODIFIED  VARIANTS  OF 

THE  THE  VOUEL 

IN 

UHICH  A 

BAND 

OF  FREQUENCY  COMPONENTS 

ARE  DELAYEO  AS  FUNCTIONS 

OF 

CENTER 

FREQUENCY, 

BANDUIOTH,  ANO 

OELAY. 

BANGUI  IOTH 

OELAY  (MILLISECONDS) 

(HERTZ)  2 

4 

8 

16 

FC  - 

250  HERTZ 

SO 

0.000 

0.000 

0.000 

0.000 

100 

0.000 

0.000 

0.067 

0.333 

200 

0.000 

0.067 

0.167 

0.833 

400 

0.000 

0.400 

0.800 

0.933 

FC  - 

500  HERTZ 

100 

0.000 

0.000 

0.167 

0.200 

200 

0.033 

0.033 

0.333 

0.600 

400 

0.000 

0.567 

0.267 

0.700 

800 

0.200 

0.800 

0.467 

0.533 

FC  - 

1000  HERTZ 

100 

0.000 

0.033 

0.067 

0.050 

200 

0.000 

0.033 

0.300 

0.400 

400 

0.033 

0.267 

0.333 

0.700 

800 

0.333 

0.967 

1.000 

1.000 

FC  - 

2000  HERTZ 

100 

0.000 

0.033 

0.067 

0.133 

200 

0.0G7 

0.033 

0.100 

0.400 

400 

0.067 

0.167 

0.633 

0.767 

800 

0.067 

0.467 

0.900 

0.967 

l 

i 
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TABLE  8 

D I SCR I HI  NAB I L I T Y OF  SUBJECT  UG  BETUEEN  A STANOARO  PLOSIVE,  /t/,  AND 
PHASE  MODIFIED  VARIANTS  OF  THE  PLOSIVE  IN  UHICH  A BAND  OF  FREQUENCY 
COMPONENTS  ARE  OELAYEO  AS  FUNCTIONS  OF  CENTER  FREQUENCY,  BANDUIQTH, 

AND  DELAY. 


BANOUIOTH 

DELAY 

(MILLISECONDS) 

(HERTZ)  2 

4 

8 

16 

FC  - 250  HERTZ 

50  0.050 

0.000 

0.000 

0.000 

100  0.000 

0.000 

0.050 

0.050 

200  0. 100 

0.500 

0.950 

1.000 

400  0.100 

0.500 

0.850 

1.000 

FC  - 500  HERTZ 

100  0.050 

0.150 

0.950 

1.000 

200  0.050 

0.100 

0.900 

1.000 

400  0.400 

0.800 

1.000 

1.000 

800  O.SOO 

0.900 

1.000 

1.000 

FC  - 1000  HERTZ 

100  0.100 

0.050 

0.100 

0.300 

200  0.050 

0.150 

0.250 

0.850 

400  0.150 

0.000 

0.400 

0.950 

800  0.200 

0.750 

0.900 

1.000 

FC  - 2000  HERTZ 

100  0.050 

0.067 

0.100 

0.100 

200  0.150 

0.133 

0.050 

0.050 

400  0.050 

0.067 

0.350 

0.950 

800  0.300 

0.500 

1.000 

1.000 
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TABLE  9 

OISCRIMINABILITY  OF  SUBJECT  UG  BETWEEN  A STANDARD  FRICATIVE,  /f /, 
AND  PHASE  nODIFIEO  VARIANTS  OF  THE  FRICATIVE  IN  WHICH  A BAND  OF 
FREQUENCY  COMPONENTS  ARE  DELAYED  AS  FUNCTIONS  OF  CENTER  FREQUENCY, 

BANDWIDTH.  ANO  DELAY. 


BANOWIOTH 

DELAY 

(MILLISECONDS) 

(HERTZ) 

4 

8 

16 

32 

FC  - 250 

HERTZ 

SO 

0.000 

0.000 

0.000 

0.000 

100 

0.100 

0.150 

0.100 

0.850 

200 

0.300 

0.100 

0.850 

1.000 

400 

0.050 

0.200 

0.900 

1.000 

FC  - SOO 

HERTZ 

100 

0.100 

0.000 

0.300 

1.000 

200 

0.050 

0.050 

0.300 

1.000 

400 

0.400 

0.250 

0.850 

1.000 

800 

0.550 

0.550 

1.000 

1.000 

FC  - 

1000  HERTZ 

100 

0.000 

0.000 

0.000 

0.600 

200 

0.033 

0.000 

0.550 

1.000 

400 

0.000 

0.167 

1.000 

1.000 

800 

0.033 

0.167 

0.950 

1.000 

FC  - 2000  HERTZ 

100  0.000 

0.000 

0.050 

0.050 

200  0.067 

0.000 

0.150 

1.000 

400  0.067 

0.200 

• 0.750 

1.000 

800  0.233 

0.650 

0.850 

1.000 
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SELECTED  SOFTUARE  DESCRIPTION 
GSYS 

CSYS  Is  an  interactive  computer  program  used  to  process, 
replay,  and  display  audio  signals,  and  to  administer  psychophysical 
tests.  The  program  is  written  in  SAIL,  an  ALGOL-like  high-level 
computer  language.  The  program  types  a herald  on  the  controlling 
terminal  to  indicate  its  readinsss  to  accept  a command.  After 
executing  any  given  command,  the  program  again  typee  the  herald  and 
re-enters  Its  waiting  state.  A list  of  the  available  commande  may 
be  obtained  by  typing  a question-mark.  Typing  the  first  character 
of  a command  results  in  the  automatic  typing  of  the  rest  of  the 
command  name.  The  program  will  request  parameter  value  input  if 
required  by  the  given  command.  The  program  is  organized  around  a 
large  program  buffer  which  is  used  to  store  audio  data  for 
proceseing,  replay,  etc.  The  audio  data  are  organized  ae  audio 
f I lee  for  manipulation.  Provieions  are  made  for  the  absolute 
accessing  of  audio  data  on  disk.  A description  of  the  audio  file 
eyetem  follows  under  the  title  of  FILES.  System  commands  Include 
the  follow I ngi 

AVGOUT  Remove  any  DC  bias  from  the  audio  data  stored  In  the 

program  buffer. 

CONVOLVE  Convolve  a given  source  file  with  a given  kernel  file.- 
The  resulte  are  left  in  the  program  buffer. 
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DRIVE 

EXPUNGE 

FREQUENCY 

GET 


Change  the  default  drive  number. 

Oelete  an  audio  file  stored  on  disk. 

Set  the  frequency  of  the  digi  tal-to-analog  converter. 
Copy  an  audio  file  from  disk  to  the  program  buffer. 


INPUT 

KERGEN 

LIST 

MAKE 

NUOROS 

ONOFF 

PLOT 

RMS 

SAVE 

TEST 

ZERO 

1I0X 

21DX 

ST’S 

•PLAY 


Copy  audio  data  from  disk  to  the  program  buffer  using 
absolute  disk  addressing. 

Generate  the  frequency  domain  representation  of  the 
idealized  filters  used  in  the  study. 

List  the  disk  file  directory. 

Generate  special  audio  signals  in  the  program  buffer. 

Specify  the  number  of  uorde  of  data  in  the  program 
buffer  in  reference  to  the  buffer’s  louer  index  to  be 
used  in  processing,  replay,  etc. 


Uindou  the  audio  data 
trapezoidal  uindou. 

in 

the 

program  buffer 

ui  th 

a 

Display  the  audio  data 

in 

the 

program  buffer 

on 

the 

storage  oscilloscope. 
GMAC. 

See 

the 

documentation  for 

PLOT 

i n 

Type  the  RMS  value  of 

the  data 

in  the  program  buffer. 

Copy  the  program  buffer  to 

a disk  file. 

Psychophysical  test.  See  the  documentation  for  IAFCT  In 
GMAC. 

Clear  the  program  buffer. 

Set  the  program  buffer's  lower  index. 

Set  the  program  buffer’s  upper  index. 

Oesign  a filter  using  the  5T*e  method.  The  KERGEN 
command  is  first  called  to  specify  the  idealized  filter 
character  I at  ice. 

Play  the  data  in  the  program  buffer  through  the 
digi tal-to-analog  converter. 
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GMAC 

GflAC  is  the  author's  Assembly  language  subroutine  library. 
The  subroutines  are  used  by  GSYS.  The  following  subroutines  are 
selected  for  description! 


SUBROUTINE  IAFCT (DRIVE, X.RLST) 


IAFCT  is  an  interactive  audio  file  comparison  test  subroutine.  It 
presents  a number  of  audio  file-pairs  for  discrimination  testing 
through  0A16A.  The  following  commands  are  provided! 

BEGIN  TEST  Start  the  test  program. 

OELAY  Set  the  time  delay  between  signal  pairs  and  between 

signals  of  the  pair. 


INDEX 


Set  the  beginning  index  into  the  signal-pair 
randomizer. 


LIST  RESULTS  Type  a list  of  the  signal-pairs  presented,  the 
number  of  times  each  pair-type  was  presented,  and 
the  number  of  correct  discriminations  for  each 
pair-type. 

NUMBER  Set  the  number  of  times  that  the  signal-pair  list  is 

randomized  and  presented. 


REF  TABLE 


Input  a list  of  signal-pairs  from  the  terminal  to  be 
presented  in  the  test.  Terminate  the  list  with  -1. 


TTY  MOOE 
? 


TRUE  (-l)i  Get  listener's  response  from  the  TTY  for 
each  signal-pair  presentation.  FALSE  (0)  i Present 
the  whole  test  without  TTY  interaction. 

Type  the  command  list. 


SUBROUTINE  APF1 (X. Y,PUR2,FC,BU,TD,SFREQ) 


APF1  specifies  the  frequency  domain  characteristics  of  an  ideal 
filter  designed  to  give  a constant  time  delay  (TO)  to  a band  of 
frequency  components  (BUI  centered  around  a given  center  frequency 
(FC).  The  magnitude  is  returned  in  X,  and  the  phase  in  Y. 
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SUBROUT  I NE  CONV (X , PUR2 . NREC , RPK , RBLK , UPK , UBLK ) 

CONV  performs  linear  convolution.  It  assumes  that  the  kernel 
reeidee  in  the  first  N words  of  array  X,  where  N-2PUR2.  The 
eubroutlne  types  the  record  number,  the  input  disk  block,  the  output 
disk  block,  and  the  number  of  packing  overflows  for  each  record. 


SUBROUTINE  PLOT (N.NUORDS, DELTA) 

PLOT  la  an  Interactive  plotting  subroutine  with  the  following 
commands* 

C Grid  complexity. 

E Expand  and  plot  with  one-shot  settings  of  grid  coordinates, 

H Horizontal  label. 

I Intensity. 

fl  Compute  new  Y-coordinates  for  each  plot. 

0 Offset  Y-coordinates  from  average. 

P Plot. 

T Grid  type. 

U Multiple  plot  on  same  grid  image. 

V Vertical  label. 

Y Y-coordlnatee  from  the  TTY. 

? Type  this  list. 

<carrlage  return>  Subroutine  return. 


FILES 

FILES  is  a simple  file  system  designed  to  facilitate  the 
storage  and  retrieval  of  data  on  private  disk  packs.  The  system 
formate  a given  pack  by  storing  a directory  image  on  the  disk  pack 
beginning  at  block  200.  The  first  two  words  of  the  directory  image 
serve  as  the  directory  header  with  the  following  format: 

WORD  Oi  Left  justified  SIXBIT  key-word  "AUDIO." 

U0R0  li  Left  justified  SIXBIT  pack  name. 
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Following  the  directory  header  are  1023  2-uord  file  deecriptor 
blocks  with  the  following  formati 

UORD  Oi  LHi  Ueer  supplied  data. 

RH:  File  block  location. 

U0R0  It  BITS  0-12:  Date  of  file  creation. 

BITS  13-35:  Length  of  file  (UOROS). 

Data  is  stored  on  disk  beginning  at  block  1000.  Data  may  be  stored 
up  to,  and  Including,  block  39999.  The  files  on  a given  pack  are 
referred  to  by  a file  number  ranging  from  1 to  1023.  Uhen  a new 
file  Is  created  the  system  finds  and  returns  the  first  free  block 
number  above  the  file  occupying  the  highest  numbered  block. 


SUBROUTINE  SETDIR (ERROR, DRIVE, NAME)  formats  a given  disk  pack  for 
the  file  system  and  clears  the  directory. 

SUBROUTINE  OELF (ERROR, DRIVE, FI LNUM)  deletes  a file  from  the 
directory. 

SUBROUTINE  DIR (ERROR, DRIVE, MODE)  types  the  directory  Information  on 
the  TTY. 

SUBROUTINE  ERR(ERROR)  types  the  error  message  associated  with  an 
error  number  on  the  TTY.  ERR  is  a NO-OP  if  ERROR-O. 

SUBROUTINE  0PENR(ERR0R, DRIVE. FlLNUtl. BLOCK, NUORDS, DATA)  returns  disk 
addressing  information  and  data  pertaining  to  a given  file  number. 
Used  to  read  old  data  from  disk. 

SUBROUT  I NE  OPENU  (ERROR , OR  I VE , F I LNUM , BLOCK , NUOROS , DATA , AGE ) re  t ur  ne 

disk  addressing  Information  and  associated  pertinent  information  for 
a given  file  number.  Used  to  write  new  data  on  disk. 
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?that  discriminability  could  be  measured  as  functions  of  center  frequency,  band-  I 
width  and  delay.  The  stimuli  used  in  the  tests  included  an  inpulse,  phonemes, 
words,  and  a sentence. 

Sensitivity  to  dispersion  in  impulses  was  shown  to  be  dependent  on  inten- 
sity, center  frequency,  bandwidth,  and  delay.  Discrimination  scores  half  way 
between  chance  and  perfect  performance  were  achieved  for  dispersed  impulses  with 
delayed  bands  of  frequency  components  centered  between  250  and  500  and 

delayed  between  0.0625  and  0.125  milliseconds.  Sensation  level  of  the'"’ stimulus 
pairs  was  80  dB.  The  effect  of  bandwidth  on  discriminability  appears  to  reach  a 
plateau  at  100  Hertz ,lwhich  is  to  say  that*,  bandwidths  of  interest  are  less  than 
100  H&’tfzT  '•  

"The'  neutral  vowel,  the  plosive  /t/,  and  the  fricative  /f/  were  used  as 
stimuli  in  experiments  in  which  phonemes  served  as  the  standard  stimuli.  The 
stimulus  pairs  were  presented  at  levels  approximating  normal  conversational 
level  speech,  i.e.,  70,  32,  and  35  dB  Sensation  Level  respectively.  Discrimin- 
ability was  observed  to  be  strongly  dependent  on  bandwidth  and  delay.  A very 
slight,  if  any,  decrease  in  discriminability  was  observed  with  increased  center 
frequency.  Rank  ordering  of  the  phonemes  in  decreasing  sensitivity  to  disper- 
sion is  plosive,  vowel,  and  fricative.  For  bandwidths  of  approximately  100  to 
400  Hertz,  the  plosive  requires  4 to  8,  the  vowel  8 to  16,  and  the  fricative  16 
to  32  milliseconds  of  delay.  For  wider  bandwidths,  the  respective  ranges  are  2 
to  4,  2 to  4,  and  4 to  8 milliseconds.  More  explicitly,  for  narrower  bandwidths, 
the  plosive  requires  4 to  8 milliseconds  for  100  to  200  Hertz  bandwidth,  the 
vowel  8 to  16  milliseconds  for  100  to  400  Hertz  bandwidth,  and  the  fricative  16 
to  32  milliseconds  for  100  to  400  Hertz  bandwidth.  For  wider  bandwidths  the 
plosive  requires  2 to  4 milliseconds  for  200  to  400  Hertz  bandwidth,  the  vowel 
2 to  4 milliseconds  for  200  to  800  Hertz  bandwidth,  and  the  fricative  4 to  8 
milliseconds  for  400  to  800  Hertz  bandwidth.  Less  delay  is  required  for  greater 
bandwidth  since  discriminability  is  directly  dependent  on  both  independent 
variables. 

Discriminability  scores  obtained  from  tests  using  words  and  a sentence  as 
standard  stimuli  agree  with  the  scores  obtained  from  tests  using  phonemes. 
Discriminability  half  way  between  chance  and  perfect  performance  was  obtained 
for  the  dispersed  sentence  in  which  a 400  Hertz  band  of  components  centered  at 
500  Hertz  was  delayed  between  4 and  8 milliseconds.  The  score  agrees  with  that 
for  a vowel  which  implies  that  cues  to  dispersed  sound  quality  in  continuous 
speech  arises  from  vowel  dispersions. 

The  results  of  the  study  indicates  that  speech  processing  systems  intro- 
ducing no  more  than  a few  milliseconds  of  dispersion  will  cause  little  detriment 
to  the  speech  quality.  This  conclusion  is  significant  inasmuch  as  it  is 
difficult,  if  not  impossible,  to  determine  the  phase  of  many  speech  processing 
systems . 
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